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ANTI-BACTERIAL METHODS AND MATERIALS 

The invention relates generally to the identification of genes 
5 responsible for the virulence of Staphylococcus bacteria, thereby allowing the 
identification of new anti-bacterial agents that target these virulence genes and 
their products and the provision of novel 5. aureus mutants useful in vaccines. 

BACKGROUND OF THE INVENTION 

1° The staphylococci, of which Staphylococcus aureus is the most 

important human pathogen, are hardy, gram-positive bacteria that colonize the 
skin of most humans. Staphylococcal strains that produce coagulase are 
designated S. aureus; other clinically important coagulase-negative staphylococci 
are S. epidermidis and S. saprophytics. When the skin or mucous membrane 

15 barriers are disrupted, staphylococci can cause localized and superficial 
infections that are commonly harmless and self limiting. However, when 
staphylococci invade the lymphatics and the blood, potentially serious 
complications may result, such as bacteremia, septic shock, and serious 
metastatic infections, including endocarditis, arthritis, osteomyelitis, pneumonia 

20 and abscesses in virtually any organ. Certain strains of S. aureus produce toxins 
that cause skin rashes, food poisoning, or multisystem dysfunction (as in toxic 
shock syndrome). S. aureus and S. epidermidis together have become the most 
common cause of nosocomial non-urinary tract infection in U.S. hospitals. 
They are the most frequently isolated pathogens in both primary and secondary 

25 bacteremias and in cutaneous and surgical wound infections. See generally 
Harrison's Principles of Internal Medicine, 13th ed., Isselbacher et al., eds., 
McGraw-Hill, New York (1994), particularly pages 611-617. 

Transient colonization of the nose by S. aureus is seen in 70 to 90 
percent of people, of which 20 to 30 percent carry the bacteria for relatively 

30 prolonged periods of time. Independent colonization of the perineal area occurs 
in 5 to 20 percent of people. Higher carriage rates of S. aureus have been 
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documented in persons with atopic dermatitis, hospital employees, hospitalized 
patients, patients whose care requires frequent puncture of the skin, and 
intravenous drug abusers. 

Infection by staphylococci usually results from a combination of 

5 bacterial virulence factors and a diminution in host defenses. Important 
microbial factors include the ability of the staphylococcus to survive under harsh 
conditions, its cell wall constituents, the production of enzymes and toxins that 
promote tissue invasion, its capacity to persist intracellularly in certain 
phagocytes, and its potential to acquire resistance to antimicrobials. Important 

10 host factors include an intact mucocutaneous barrier, an adequate number of 
functional neutrophils, and removal of foreign bodies or dead tissue. 

Cell wall components of 5. aureus include a large peptidoglycan 
complex that confers rigidity on the organism and enables it to survive under 
unfavorable osmotic conditions, a unique teichoic acid linked to peptidoglycan, 

15 and protein A, which is found both attached to peptidoglycan over the outermost 
parts of the cell and released in soluble form. Proteins designated femA and 
femB are involved in the formation of cell wall peptidoglycan pentaglycine 
cross-bridges and are factors in methicillin resistance. [Berger-Bachi et aL, 
MoL Gen. Genet., 279:263-269 (1989).] S. aureus also has specific receptors 

20 for laminin and fibronectin that may mediate the organism's spread through the 
bloodstream to other tissues. Both peptidoglycan and teichoic acid are capable 
of activating the complement cascade via the alternative pathway. S, aureus also 
appears to activate tissue factor in the coagulation pathway. 

Certain enzymes produced by S. aureus may play a role in 

25 virulence. Catalase degrades hydrogen peroxide and may protect the organism 
during phagocytosis. Coagulase is present in both soluble and cell-bound forms 
and causes plasma to clot by formation of thrombin-like material. The high 
correlation between coagulase production and virulence suggests that this 
substance is important in the pathogenesis of staphylococcal infections, but its 

30 precise role as a determinant of pathogenicity has not been determined. Many 
strains also produce hyaluronidase, an enzyme that degrades hyaluronic acid in 
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the connective tissue matrix and that may promote spreading of infection. A 
trypsin-like protease from some strains enhances influenza virus infection by 
proteolytic cleavage of the viral precursor hemagglutinin into its active 
fragments and may contribute to the morbidity of such coinfections. 

5 S. aureus produces numerous extracellular exotoxins that have 

been implicated in disease processes. The exfoliatin toxins A and B, the 
staphylococcal enterotoxins, and the toxic shock syndrome toxin, TSST-1, 
belong to the growing family of microbial superantigens that activate T cells and 
monocytes/macrophages, resulting in the production of cytokines that mediate 

10 local or systemic effects depending on the amount of toxin formed, the immune 
status of the host, and the access of the toxin to the circulation. The exfoliatin 
toxins mediate the dermatologic manifestations of the staphylococcal scalded- * 
skin syndrome and bullous impetigo. These toxins cause intraepidermal 
cleavage of the skin at the stratum granulosum, leading to bullae formation and 

15 denudation. Seven distinct enterotoxins (A, B, Cl t C2, C3, D, and E) have 
been implicated in food poisoning due to S. aureus. These toxins enhance 
intestinal peristalsis and appear to induce vomiting by a direct effect on the 
central nervous system. Toxic shock syndrome (TSS) is most commonly 
mediated by TSST-1, which is present in 5 to 25 percent of clinical isolates of S. 

20 aureus. TSS is also mediated less frequently by enterotoxin B and, rarely, 
enterotoxin CI. 

S. aureus produces other toxins whose role in virulence is 
incompletely understood. Four different red blood cell hemolysins, which are 
designated alpha, beta, gamma, and delta toxins, have been identified. Alpha 

25 toxin also causes necrosis of the skin when injected subcutaneously into animals, 
while delta toxin also inhibits water absorption in the intestines and may play a 
role in the acute watery diarrhea seen in some cases of staphylococcal infection. 
Leukocidin lyses granulocyte and macrophage membranes by producing 
membrane pores permeable to cations. 

30 The agr y xpr, sae and sar genes have been identified as being 

involved in the regulation of staphylococcal exotoxins. See U.S. Patent No. 
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5,587,228 and International Patent Publication Nos. WO 96/10579 and WO 
97/11690. Of interest is the report in WO 97/11690 of screening for inhibitors 
of these regulatory systems. 

Staphylococci can invade the skin or mucosa through plugged hair 

5 follicles and sebaceous glands or areas traumatized by burns, wounds, abrasions, 
insect bites, or dermatitis. Staphylococci often colonize prosthetic devices and 
intravenous catheters; S. aureus infection of the vascular access site is a major 
cause of morbidity and death among patients on hemodialysis. Colonization and 
. invasion of the lungs may occur with endotracheal intubation, or when the 

10 lungs' clearance mechanisms are depressed, e.g., after viral infections, after 
aspiration, or in patients with cystic fibrosis. Mucosal damage to the 
gastrointestinal tract following cytotoxic chemotherapy or radiotherapy 
predisposes to invasion from that site. 

Once the skin or mucosa have been breached, local bacterial 

15 multiplication is accompanied by inflammation, neutrophil accumulation, tissue 
necrosis, thrombosis and fibrin deposition at the site of infection. Later, 
fibroblasts create a relatively avascular wall about the area. When host 
mechanisms fail to contain the cutaneous or submucosal infection, staphylococci 
may enter the lymphatics and the bloodstream. Common sites of metastatic 

20 spread include the lungs, kidneys, cardiac valves, myocardium, liver, spleen, 
bones and brain. 

Bacteremia due to S. aureus may arise from any local infection, at 
either extravascular (cutaneous infections, burns, cellulitis, osteomyelitis, 
arthritis) or intravascular foci (intravenous catheters, dialysis access sites, 

25 intravenous drug abuse). Rarely, patients with bacteremia die within 12 to 24 
hours with high fever, tachycardia, cyanosis, and vascular collapse. 
Disseminated intravascular coagulation may produce a disease mimicking 
meningococcemia. Commonly, the disease progresses more slowly, with hectic 
fever and metastatic abscess formation. 

30 A major complication of S. aureus bacteremia is endocarditis. S. 

aureus is the second most common cause of endocarditis and the most common 
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cause among drug addicts. The disease is typically acute, with high fever, 
progressive anemia, and frequent embolic and extracardiac septic complications. 
Valve ring and myocardial abscesses are common. The mortality rate is 20 to 
30 percent. 

5 Staphylococcal scalded-skin syndrome (SSSS) is a generalized 

exfoliative dermatitis that is a complication of infection by exfoliatin toxin- 
producing strains of S. aureus. The disease typically occurs in newborns 
(Ritter's disease) and in children under the age of five. A scarlatiniform rash 
begins in the perioral area, becomes generalized over the trunk and extremities, 

10 and finally desquamates. The disease may consist of rash alone (staphylococcal 
scarlet fever), or large, flaccid bullae develop that may be localized (more 
common in adults) or generalized. The bullae burst, resulting in red, denuded 
skin resembling a burn. Most adults with SSSS are immunosuppressed or have 
renal insufficiency. Blood cultures are frequently positive, and mortality is 

15 significant. 

Toxic shock syndrome (TSS) is a multisystem disease mediated 
by toxins (generally TSST-1, and less frequently enterotoxins B and CI) 
produced by certain strains of S. aureus. It was first described in children, but 
in 1980 became epidemic among young women, with onset during menstruation. 

20 The diagnosis of TSS is based on clinical criteria that include high fever, a 
diffuse rash that desquamates on the palms and soles over the subsequent one or 
two weeks, hypotension that may be orthostatic, and evidence of involvement in 
three or more organ systems. Such involvement commonly includes 
gastrointestinal dysfunction (vomiting or diarrhea), renal or hepatic 

25 insufficiency, mucous membrane hyperemia, thrombocytopenia, myalgias with 
elevated creatine phosphokinase (CK) levels, and disorientation with a normal 
cerebrospinal fluid examination. The mortality rate of TSS is three percent. 

S. aureus causes approximately three percent of community- 
acquired bacterial pneumonias. This disease occurs sporadically except during 

30 influenza outbreaks, when staphylococcal pneumonia is relatively more 
common, although still less frequent than pneumococcal pneumonia. Primary 
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staphylococcal pneumonia in infants and children frequently presents with high 
fever and cough. Multiple thin-walled abscesses are seen on the chest X-ray, 
and empyema formation is common. In older children and healthy adults, 
staphylococcal pneumonia is generally preceded by an influenza-like respiratory 
5 infection. Onset of staphylococcal involvement is abrupt, with chills, high 
fever, progressive dyspnea, cyanosis, cough, pleural pain, and sometimes 
bloody sputum. Staphylococcal pneumonia is seen more frequently in patients 
with cystic fibrosis, in intubated patients in intensive care units and in debilitated 
patients who are prone to aspiration. 
10 S. aureus is responsible for the majority of cases of acute 

osteomyelitis. Although the disease is most common in people under the age of 
20, it is becoming increasingly prevalent in adults over 50, particularly with 
involvement of the spine. A primary portal of entry is frequently not identified, 
although many patients give a history of preceding trauma to the involved area. 
15 Once established, infection spreads through the bone to the periosteum or along 
the marrow cavity. Rarely, the joint capsule is penetrated, producing pyogenic 
arthritis. Osteomyelitis in children may present as an acute process beginning 
abruptly with chills, high fever, nausea, vomiting, and progressive pain at the 
site of bony involvement. 
20 S. aureus causes 1 to 9 percent of cases of bacterial meningitis 

and 10 to 15 percent of brain abscesses. Most commonly, the bacteria are 
spread from a focus outside the central nervous system, typically from infective 
endocarditis, by extension from a paraspinal or parameningeal abscess, or by 
nosocomial infection following neurosurgical procedures. Over 50 percent of 
25 epidural abscesses are due to S. aureus; up to half of these cases may be 
associated with vertebral osteomyelitis. Patients present with either acute or 
chronic back pain, usually with low-grade fever and malaise. The onset of 
radicular pain is an ominous sign that the disease may progress to neurologic 
dysfunction and ultimate paralysis. 
30 Antimicrobial resistance by staphylococci favors their persistence 

in the hospital environment. Over 90 percent of both hospital and community 
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strains of S. aureus causing infection are resistant to penicillin. This resistance 
is due to the production of P-lactamases enzymes; the genes for these enzymes 
are usually carried by plasmids. Infections due to organisms with such acquired 
resistance can sometimes be treated with penicillinase-resistant P-lactam 

5 antimicrobial agents. However, the true penicillinase-resistant S. aureus 
organisms, called methicillin-resistant S. aureus (MRS A), are resistant to all the 
p-lactam antimicrobials as well as the cephalosporins. MRS A resistance is 
chromosomally mediated and involves production of an altered penicillin-binding 
protein (PBP 2a or PBP 2') with a low binding affinity for p-lactams. MRS A 

10 frequently also have acquired plasmids mediating resistance to erythromycin, 
tetracycline, chloramphenicol, clindamycin, and aminoglycosides. MRSA have 
become increasingly common worldwide, particularly in tertiary-care referral 
hospitals. In the United States, approximately 5 percent of hospital isolates of 
S. aureus are methicillin-resistant. 

15 Thus, there continues to exist a need for new agents useful for 

treating bacterial infections, particularly those caused by antibiotic-resistant 
bacteria, and for methods of identifying such new agents. Such methods ideally 
would identify agents that are unrelated to existing antimicrobials and that target 
different aspects of staphylococcal invasion of and replication in the host, 

20 compared to existing antimicrobials. 

SUMMARY OF THE INVENTION 

The present invention relates generally to the identification of 
genes responsible for the virulence of Staphylococcus bacteria, thereby allowing 
25 the identification of new anti-bacterial agents that target these virulence genes 
and their products and the provision of novel S. aureus mutants useful in 
vaccines. 

According to one aspect of the present invention, methods are 
provided for identifying anti -bacterial agents that target the function of 
30 staphylococcal virulence genes or gene products. Such methods include 
assaying potential agents for the ability to interfere with expression of virulence 
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gene products represented by the DNA sequences set forth in any one of SEQ 
ID NOS: 1 through 94, or assaying potential agents for the ability to interfere 
with the function of a bacterial protein encoded in whole or in part by a DNA 
sequence set forth in any one of SEQ ID NOS: 1 through 94 or the 
5 complementary strand thereof, followed by identifying agents that are positive in 
such assays. 

The use of a number of different assays is contemplated according 
to this aspect of the invention. When the function of the virulence gene product 
is known or predicted by sequence similarity to a known gene product, potential 

10 inhibitors can be screened in enzymatic or other types of assays keyed to the 
function of the gene product. When the virulence gene product is known or 
predicted by sequence similarity to a known gene product to interact with 
another protein or nucleic acid, inhibitors of this interaction can be screened 
directly in binding assays or using the two-hybrid assay. Other assays may be 

15 used when a ligand for the virulence gene product is not known, including two- 
hybrid screening assays that identify gene products that interact with target 
protein, assays that identify ligands of target protein through measuring of direct 
binding of test ligand to target protein, and assays that identify ligands of target 
proteins through affinity ultrafiltration with ion spray mass spectroscopy/HPLC 

20 methods or other physical and analytical methods. 

In another aspect of this invention, methods are provided for 
assaying potential ' agents for the ability to interfere with expression of or 
function of virulence gene products, wherein the virulence genes encoding these 
products are obtainable by identification through signature-tagged mutagenesis 

25 as defined herein and exemplified in Example 1 . 

According to a further aspect of this invention, novel anti- 
bacterial agents identified by the methods described herein are provided, as well 
as methods for treating a subject suffering from infection with staphylococci 
involving administration of such novel anti-bacterial agents. In particular, 

30 agents that interfere with the expression of virulence gene products include anti- 
sense polynucleotides that are complementary to the virulence gene sequences. 
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Agents that interfere with the function of virulence gene products include 
variants of virulence gene products, ligands of these virulence gene products and 
variants thereof, and enzyme inhibitors (where the product is an enzyme). 

Yet a further aspect of this invention provides Staphylococcus 

5 aureus organisms containing a functional mutation in a gene represented by any 
one of SEQ ID NOS: 1 through 94, said functional mutation resulting in a 
reduction in virulence of the organism. Also contemplated are vaccine 
compositions comprising such mutated S. aureus organisms, optionally 
comprising a suitable adjuvant and a pharmaceutical^ acceptable diluent or 

10 carrier. 

Numerous additional aspects and advantages of the invention will 
become apparent to those skilled in the art upon consideration of the following 
detailed description of the invention which describes presently prepared 
embodiments thereof. 

15 

DETAILED DESCRIPTION OF THE INVENTION 

"Virulence genes," as used herein, are genes whose function or 
products are required for successful establishment and/or maintenance of 
bacterial infection in a host animal. Thus, virulence genes and/or the proteins 

20 encoded thereby are involved in pathogenesis in the host organism, but may not 
be necessary for growth in vitro. Since antibiotics are typically screened in 
vitro, identification of these in vivo virulence genes provides a means for 
discovering new antimicrobials with different targets and mechanisms of action 
compared to existing antibiotics. There may be 50 to 100 virulence genes in S. 

25 aureus [see Groisman and Ochman, Trends Microbiol. Sci., 2:289-294 (1984) 
(discussing Salmonella virulence genes); Muhldorfer and Hacker, Microb. 
Pathogenesis, 76:171-181 (1994) (discussing E. coli virulence genes). 

"Signature-tagged mutagenesis," as used herein, is a method 
generally described in International Patent Publication No. WO 96/17951, 

30 incorporated herein by reference, and includes, for example, a method for 
identifying 5. aureus genes required for virulence in a murine model of 
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bacteremia. In this method, each insertional mutation carries a different DNA 
signature tag which allows mutants to be differentiated from each other. The 
tags comprise 40-bp variable central regions flanked by invariant "arms" of 
20-bp which allow the central portions to be co-amplified by polymerase chain 

5 reaction (PCR). Tagged mutant strains are assembled in microtitre dishes, then 
combined to form the "inoculum pool" for infection studies. At an appropriate 
time after inoculation, bacteria are isolated from the animal and pooled to form 
the "recovered pool." The tags in the recovered pool and the tags in the 
inoculum pool are separately amplified, labelled, and then used to probe filters 

10 arrayed with the different tags representing the mutants in the inoculum. 
Mutants with attenuated virulence are those with tags that give hybridization 
signals when probed with tags from the inoculum pool but not when probed with 
tags from the recovered pool. 

Signature-tagged mutagenesis allows a large number of insertional 

15 mutant strains to be screened simultaneously in a single animal for loss of 
virulence. Screening thirteen pools of 96 mutant S. aureus strains resulted in 
the identification of 50 strains with reduced virulence, many of which were 
confirmed to be attenuated in virulence by subsequent analysis of individual 
mutants. The nucleotide sequences of the regions flanking the transposon 

20 insertion points of these mutants were analyzed by searching DNA and protein 
sequence databases to identify the genes inactivated by the insertion of the 
transposon. 

On the basis of these searches many of the virulence genes may 
be grouped into different classes. The first class encodes proteins involved in 

25 cell surface metabolism (e.g., P2C73, P11C29, P13C83, P9B65, P10B89). 
Both femA and femB y which are involved in the formation of cell wall 
peptidoglycan pentaglycine cross bridges, were identified as virulence genes. 
Mutant P2C73 contains a transposon insertion in a previously unknown gene 
that shares significant similarity to femB. Mutant P14C15 contains a transposon 

30 insertion in a gene whose product is similar to aspartate semialdehyde 
dehydrogenase (Asd) from different bacteria, with the highest level of similarity 
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to Asd from Bacillus subtilis. Asd is a key enzyme in the biosynthesis of 
methionine, threonine, isoleucine, lysine and diaminopimelic acid (DAP), which 
is an important component of cell wall peptidoglycan. 

The second class encodes enzymes involved in cellular 

5 biosynthetic pathways (e.g., P9B74, P5C4, P9B66, P14C15, P13B26, P7C18, 
P15C31, P10B18, P6B18, P10B66, P10C34, P12C3). Deduced protein 
products of two genes (mutants P7C18 and P13B26) show strong similarity to B. 
subtilis LysA and ThrB. These enzymes, like Asd, are involved in aspartate 
metabolism. LysA is diaminopimelate decarboxylase, which converts 

10 diaminopimelate to lysine, and ThrB phosphorylates homoserine before 
conversion into threonine. Transposon insertions were also obtained in genes 
homologous to Methanococcus jannaschii trp A, Lactococcus lactis trpB and L. 
lactis trpD. These genes encode enzymes of the tryptophan biosynthetic 
pathway: the alpha chain of tryptophan synthetase, the beta chain of tryptophan 

15 synthetase, and anthranilate phosphoribosyltransferase, respectively. The gene 
mutated in P15C31 is a homolog of L. lactis purL encoding 
phosphoribosylformylglycinamidine decarboxylase, an enzyme of the purine 
biosynthetic pathway. Mutant P9B66 contains an insertion in a gene whose 
product is similar to peptide methionine sulphoxide reductases. 

20 A third class of genes are those encoding components of the TCA 

cycle (e.g., mutants P4C27, P4C52, P10B2, P10C20, P12C32). Strains P10B2 
and P12C32 carry mutations in genes for a subunit of the oxoglutarate 
dehydrogenase complex and aconitase, respectively. 

The fourth class is composed of genes whose products are similar 

25 to a oligopeptide transport proteins of the ATP-binding cassette (ABC) 
transporter superfamily (e.g., mutants P7C26, P10C15, P5C3, P11C66, 
P5C34). Oligopeptide transport is important for peptide utilization and the 
proteolytic system in lactococci. In Group A streptococci, Opp proteins are 
involved not only in obtaining nutrients but also in adherence, protease 

30 production and processing of secreted proteins. 
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The fifth class of genes are involved in cellular regulatory and 
repair processes (e.g., mutants P4C15, P13B74, P13C72, P10B30, P6C63, 
P14B25). Mutant P4C15 and P6C63 contain insertions in S. aureus 
MarR/LuxR-like regulatory proteins. MarR and LuxR belong to a family of 

5 transcription regulators and these MarR/LuxR-like proteins likely have a similar 
function in S. aureus. In Streptococcus pneumoniae, Neisseria gonorrhoeae and 
Escherichia coli this enzyme helps to maintain surface adhesins in their 
functional oxidative state. Mutant P10B30 is associated with a transposon 
insertion in a gene with a product similar to the ATP-dependent Clp protease of 

10 E. coli. The Clp stress response system for intracellular protein degradation is 
widely conserved in bacteria and components of the system are important for 
virulence of Listeria monocytogenes and 5. typhimurium. Mutants P13B74 and 
P13C72 have stem-loop termination sequences which possibly function in 
transcription termination. 

15 The identification of these genes as virulence genes renders them 

useful in methods of identifying new anti-bacterial agents according to the 
present invention. Such methods include assaying potential agents for the ability 
to interfere with expression of virulence gene products represented by the DNA 
sequences set forth in any one of SEQ ID NOS: 1 through 94 (i.e.. the genes 

20 represented by DNA sequences of SEQ ID NOS: 1 through 94 encode the 
virulence gene product, or the DNA sequences of SEQ ID NOS: 1 through 94 
are adjacent to the gene encoding the virulence gene product, or are involved in 
regulation of expression of the virulence gene product), or assaying potential 
agents for the ability to interfere with the function of a bacterial protein encoded 

25 in whole or in part by a DNA sequence set forth in any one of SEQ ID NOS: 1 
through 94 or the complementary strand thereof, followed by identifying agents 
that are positive in such assays. Polynucleotides and polypeptides useful in 
these assays include not only the genes and encoded polypeptides as disclosed 
herein, but also variants thereof that have substantially the same activity as the 

30 wild-type genes and polypeptides. "Variants," as used herein, includes 
polynucleotides or polypeptides which contain one or more deletions, insertions 
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or substitutions, as long as the variant retains substantially the same activity of 
the wild-type polynucleotide or polypeptide. With regard to polypeptides, 
deletion variants are contemplated to include fragments lacking portions of the 
polypeptide not essential for biological activity, and insertion variants are 

5 contemplated to include fusion polypeptides in which the wild-type polypeptide 
or fragment thereof have been fused to another polypeptide. 

The virulence genes may be cloned by PCR, using S. aureus 
genomic DNA as the template. For ease of inserting the gene into expression 
vectors, PCR primers are chosen so that the PCR-amplified gene has a 

10 restriction enzyme site at the 5* end preceding the initiation codon ATG, and a 
restriction enzyme site at the 3' end after the termination codon TAG, TGA or 
TAA. If desirable, the codons in the gene are changed, without changing the 
amino acids, according to E. coli codon preference described by Grosjean and 
Fiers, Gene, 75: 199-209 (1982), and Konigsberg and Godson, Proc. Natl. Acad. 

15 Set (USA), 50:687-691 (1983). Optimization of codon usage may lead to an 
increase in the expression of the gene product when produced in E. coli. If the 
gene product is to be produced extracellularly, either in the periplasm of E. coli 
or other bacteria, or into the cell culture medium, the gene is cloned without its 
initiation codon and placed into an expression vector behind a signal sequence. 

20 For example, cloning and expression of the femA gene is described in Example 3 
below. 

To simplify the protein purification process, a purification tag 
may be added either at the 5* or 3' end of the gene coding sequence. Commonly 
used purification tags include a stretch of six histidine residues (U.S. Patent 

25 Nos. 5,284,933 and 5,310,663), a streptavidin-affmity tag described by Schmidt 
and Skerra, Protein Engineering, 6:109-122 (1993), a FLAG peptide [Hopp et 
al. y Biotechnology, 6:1205-1210 (1988)], glutathione S-transferase [Smith and 
Johnson, Gene, 67:31-40 (1988)], and thioredoxin [LaVallie et al. y 
Bio/Technology, 77:187-193 (1993)]. To remove these peptide or polypeptides, 

30 a proteolytic cleavage recognition site may be inserted at the fusion junction. 
Commonly used proteases are factor Xa, thrombin, and enterokinase. 
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Proteins are produced in any number of well-known prokaryotic 
or eukaryotic expression systems using known promoters, vectors, and hosts. 
Any suitable host cell may be used for expression of the gene product, such as 
E. coli, other bacteria, including Bacillus and S. aureus, yeast, including Pichia 

5 pastoris and Saccharomyces cerevisiae, insect cells, or mammalian cells, 
including CHO cells, utilizing suitable vectors known in the art. Proteins may 
be produced directly or fused to a peptide or polypeptide, and either 
intracellularly or extracellularly by secretion into the periplasmic space of a 
bacterial cell or into the cell culture medium. Secretion of a protein requires a 

10 signal peptide (also known as pre-sequence); a number of signal sequences from 
prokaryotes and eukaryotes are known to function for the secretion of 
recombinant proteins. . During the protein secretion process, the signal peptide is 
removed by signal peptidase to yield the mature protein. 

The virulence gene products produced by the methods described 

15 above are used in high throughput assays to screen for inhibitory agents. The 
sources for potential agents to be screened are chemical compound libraries, 
fermentation media of Streptomyceies, other bacteria and fungi, and cell extracts 
of plants and other vegetations. For proteins with known enzymatic activity, 
assays are established based on the activity, and a large number of potential 

20 agents are screened for ability to inhibit the activity. For proteins that interact 
with another protein or nucleic acid, binding assays are established to measure 
such interaction directly, and the potential agents are screened for ability to 
inhibit the binding interaction. 

Alternatively, such binding interactions are evaluated indirectly 

25 using the yeast two-hybrid system described in Fields and Song, Nature, 
540:245-246 (1989), and Fields and Sternglanz, Trends in Genetics, 70:286-292 
(1994), both of which are incorporated herein by reference. The two-hybrid 
system is a genetic assay for detecting interactions between two proteins or 
polypeptides. It can be used to identify proteins that bind to a known protein of 

30 interest, or to delineate domains or residues critical for an interaction. 
Variations on this methodology have been developed to clone genes that encode 
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DNA-binding proteins, to identify peptides that bind to a protein, and to screen 
for drugs. The two-hybrid system exploits the ability of a pair of interacting 
proteins to bring a transcription activation domain into close proximity with a 
DNA-binding domain that binds to an upstream activation sequence (UAS) of a 

5 reporter gene, and is generally performed in yeast. The assay requires the 
construction of two hybrid genes encoding (1) a DNA-binding domain that is 
fused to a protein X, and (2) an activation domain fused to a protein Y. The 
DNA-binding domain targets the first hybrid protein to the UAS of the reporter 
gene; however, because most proteins lack an activation domain, this DNA- 

10 binding hybrid protein does not activate transcription of the reporter gene. The 
second hybrid protein, which contains the activation domain, cannot by itself 
activate expression of the reporter gene because it does not bind the UAS. 
However, when both hybrid proteins are present, the noncovalent interaction of 
protein X and protein Y tethers the activation domain to the UAS, activating 

15 transcription of the reporter gene. When the virulence gene product (protein X, 
for example) is already known to interact with another protein or nucleic acid 
(protein Y, for example), this assay can be used to detect agents that interfere 
with the interaction of X and Y. Expression of the reporter gene is monitored 
as different test agents are added to the system; the presence of an inhibitory 

20 agent results in lack of a reporter signal. 

When the function of the virulence gene product is unknown and 
no ligands are known to bind the gene product, the yeast two-hybrid assay can 
also be used to identify proteins that bind to the gene product. In an assay to 
identify proteins that bind to protein X (the target protein), a large number of 

25 hybrid genes each containing a different protein Y are produced and screened in 
the assay. Typically, Y is encoded by a pool of plasmids in which total cDNA 
or genomic DNA is ligated to the activation domain. This system is applicable 
to a wide variety of proteins, and it is not even necessary to know the identity or 
function of protein Y. The system is highly sensitive and can detect interactions 

30 not revealed by other methods; even transient interactions may trigger 
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transcription to produce a stable mRNA that can be repeatedly translated to yield 
the reporter protein. 

Other assays may be used to search for agents that bind to the 
target protein. One such screening method to identify direct binding of test 

5 ligands to a target protein is described in U.S. Patent No. 5,585,277, 
incorporated herein by reference. This method relies on the principle that 
proteins generally exist as a mixture of folded and unfolded states, and 
continually alternate between the two states. When a test ligand binds to the 
folded form of a target protein (i.e., when the test ligand is a ligand of the target 

10 protein), the target protein molecule bound by the ligand remains in its folded 
state. Thus, the folded target protein is present to a greater extent in the 
presence of a test ligand which binds the target protein, than in the absence of a 
ligand. Binding of the ligand to the target protein can be determined by any 
method which distinguishes between the folded and unfolded states of the target 

15 protein. The function of the target protein need not be known in order for this 
assay to be performed. Virtually any agent can be assessed by this method as a 
test ligand, including, but not limited to, metals, polypeptides, proteins, lipids, 
polysaccharides, polynucleotides and small organic molecules. For example, 
use of femA in this method of screening for potential ligands is described in 

20 Example 3 below. 

Another method for identifying ligands for a target protein is 
described in Wieboldt et aL, Anal Chem., 69:1683-1691 (1997), incorporated 
herein by reference. This technique screens combinatorial libraries of 20-30 
agents at a time in solution phase for binding to the target protein. Agents that 

25 bind to the target protein are separated from other library components' by 
centrifugal ultrafiltration. The specifically selected molecules that are retained 
on the filter are subsequently liberated from the target protein and analyzed by 
HPLC and pneumatically assisted electrospray (ion spray) ionization mass 
spectroscopy. This procedure selects library components with the greatest 

30 affinity for the target protein, and is particularly useful for small molecule 
libraries. 
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The inhibitors/binders identified by the initial screens are 
evaluated for their effect on virulence in in vivo mouse models of S. aureus 
infections. Models of bacteremia, endocarditis, septic arthritis, soft tissue 
abscess or pneumonia may be utilized. Inhibitors/binders that interfere with 

5 bacterial virulence are capable of preventing the establishment of an infection or 
reversing the outcome of an infection once it is established. 

The identification of S. aureus virulence genes also provides for 
microorganisms exhibiting reduced virulence, which are useful in vaccines. 
Such microorganisms include the S. aureus mutants generated in Example 1 

10 below and other S. aureus mutants containing at least one functional mutation in 
a gene represented by any one of SEQ ID NOS: 1 through 94. The reduced 
virulence of these organisms and their immunogenicity may be confirmed by 
administration to a subject. While it is possible for an avirulent microorganism 
of the invention to be administered alone, one or more of such mutant 

15 microorganisms are preferably administered in a vaccine composition containing 
suitable adjuvant(s) and pharmaceutical^ acceptable diluent(s) or carrier(s). 
The carrier(s) must be "acceptable" in the sense of being compatible with the 
avirulent microorganism of the invention and not deleterious to the subject to be 
immunized. Typically, the carriers will be water or saline which will be sterile 

20 and pyrogen free. The subject to be immunized is a subject needing protection 
from a disease caused by a virulent form of S. aureus. 

Any adjuvant known in the art may be used in the vaccine 
composition, including oil-based adjuvants such as Freund's Complete Adjuvant 
and Freund's Incomplete Adjuvant, mycolate-based adjuvants (e.g., trehalose 

25 dimycolate), bacterial lipopolysaccharide (LPS), peptidoglycans (i.e., mureins, 
mucopeptides, or glycoproteins such as N-Opaca, muramyl dipeptide [MDP], or 
MDP analogs), proteoglycans (e.g., extracted from Klebsiella pneumoniae), 
streptococcal preparations (e.g., OK432), Biostim™ (e.g., 01K2), the "Iscoms" 
of EP 109 942, EP 180 564 and EP 231 039, aluminum hydroxide, saponin, 

30 DEAE-dextran, neutral oils (such as miglyol), vegetable oils (such as arachis 
oil), liposomes, Pluronic® polyols or the Ribi adjuvant system (see, for example 
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GB-A-2 189 141). Recently, an alternative adjuvant consisting of extracts of 
Amycolata, a bacterial genus in the order Actinomycetales, has been described 
in U.S. Patent No. 4,877,612. Additionally, proprietary adjuvant mixtures are 
commercially available. The adjuvant used will depend, in part, on the recipient 

5 organism. The amount of adjuvant to administer will depend on the type and 
size of animal. Optimal dosages may be readily determined by routine methods. 

The vaccine compositions optionally may include 
pharmaceutical^ acceptable (i.e., sterile and non-toxic) liquid, semisolid, or 
solid diluents that serve as pharmaceutical vehicles, excipients, or media. Any 

10 diluent known in the art may be used. Exemplary diluents include, but are not 
limited to, polyoxyethylene sorbitan monolaurate, magnesium stearate, methyl- 
and propylhydroxybenzoate, talc, alginates, starches, lactose, sucrose, dextrose, 
sorbitol, mannitol, gum acacia, calcium phosphate, mineral oil, cocoa butter, 
and oil of theobroma. 

15 The vaccine compositions can be packaged in forms convenient 

for delivery. The compositions can be enclosed within a capsule, sachet, cachet, 
gelatin, paper or other container. These delivery forms are preferred when 
compatible with entry of the immunogenic composition into the recipient 
organism and, particularly, when the immunogenic composition is being 

20 delivered in unit dose form. The dosage units can be packaged, e.g., in tablets, 
capsules, suppositories or cachets. 

The vaccine compositions may be introduced into the subject to 
be immunized by any conventional method including, e.g., by intravenous, 
intradermal, intramuscular, intramammary, intraperitoneal, or subcutaneous 

25 injection; by oral, sublingual, nasal, anal, vaginal, or transdermal delivery; or 
by surgical implantation, e.g., embedded under the splenic capsule or in the 
cornea. The treatment may consist of a single dose or a plurality of doses over 
a period of time. 

It will be appreciated that the vaccine of the invention may be 

30 useful in the fields of human medicine and veterinary medicine. Thus, the 
subject to be immunized may be a human or an animal, for example, cows, 



WO 99/01473 PCT/GB98/01974 

-19- 

sheep, pigs, horses, dogs and cats, and poultry such as chickens, turkeys, ducks 
and geese. 

Other aspects and advantages of the present invention will be 
understood upon consideration of the following illustrative examples. Example 
5 1 addresses identification of S. aureus genes associated with virulence by 
generating S. aureus mutants that contain a chromosomal insertion of a 
signature-tagged transposon and identifying the mutants with reduced virulence. 
Example 2 addresses confirmation of the attenuated virulence of individual 
mutants. Example 3 addresses use of the virulence genes and gene products in 
10 assays for screening potential agents for anti-bacterial activity. 

Figure legends 

Fig. 1. A. Map of svr and plasmids for complementation. Sites for restriction 
enzymes are indicated (A, Alu\\ E, Ec6Rl\ P, Pst I). B. Deduced amino acid 
15 sequence of the Svr protein. C. Southern hybridization of chromosomal DNA. 
DNAwas extracted from 5. aureus strains RN6390, Newman, Wood 46, ID401 
and ID402 and digested with HindlU. The digested DNA was separated by 
agarose gel electrophoresis, transferred onto nylon membrane and probed with 
32p 

-labeled svr gene. 

20 Fig.2. Phenotypic analysis of P6C63. A. Production of a-, (J- and 5- toxins. S. 
aureus strains were streaked onto rabbit blood agar plate (for a-toxin), sheep 
blood agar plates (for p-toxin) and horse blood agar plates (for 8-toxin) culture 
overnight. Halos surrounding bacterial colonies are indicative of toxin activity. 

B. Western immunoblot of protein A. Whole cell proteins were extracted from 
25 each strain and equal amounts were separated by SDS-PAGE electrophoresis 

followed by Western immunoblotting with anti-protein A monoclonal antibody. 

C. Protein samples in (B) detected by PAGE and Coomassie Blue staining. 

Fig. 3. In vivo survival kinetics of bacterial strains. Two groups of mice were 
infected i.p. with lxlO 5 cfu of S. aureus wild-type strain RN6390 or svr 
30 mutant P6C63. At 6, 24, 48, 72, 96 and 120 hours post-inoculation, two 
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mice/group were killed and the number of bacterial cfu/spleen was determined. 
Data are the mean ± standard deviations at each time point. 
Fig. 4. A. RT-PCR analysis of svr transcripts in strains RN6390, RN6911 and 
P6C63. Total RNA isolated from 10 9 cfu of post-exponential phase growth S. 
5 aureus strains. RT-PCR was performed using two primers that amplify a 200 
bp region within the svr gene. B and C. Northern blot analysis. Total RNA was 
isolated from 10 9 cfu of post-exponential phase growth S. aureus strains 
RN6390, RN6911, P6C63 and complemented strain P6C63(pID437). RNA was 
separated by 1.5% agarose-0.66M formaldehyde gel electrophoresis, transferred 
10 to a nylon membrane and probed with 32p_i a t, e n e( i genes as indicated. 

EXAMPLE 1 

Identification of 5. aureus genes associated with virulence 
15 5. aureus genes associated with virulence were identified by 

signature-tagged mutagenesis as follows, generally according to International 
Patent Publication No. WO 96/17951 and Hensel et ai, Science, 269:400-403 
(1995). 

20 A. Construction of Plasmid pID408 

A temperature-sensitive shuttle plasmid pID408 was constructed 
for use in transferring the signature-tagged transposons into S. aureus, DNA 
restriction digestions and ligations were performed as described by Sambrook et 
aL, Molecular cloning: a laboratory manual, 2nd ed., Cold Spring Harbor 

25 Laboratory Press, Cold Spring Harbor, NY (1989). A 6.5 kb BamHl fragment 
of pTV32ts [described in Youngman, in Plasmids: a practical approach, K. 
• Hard, ed., Oxford: IRL Press, pages 79-103 (1985); obtained from J. Iandolo, 
Department of Diagnostic Medicine, College of Veterinary Medicine, 
Manhattan, Kansas] carrying the temperature-sensitive replicon pE194ts and a 

30 chloramphenicol (cm) resistance gene (cm R ) was inserted into the BamHl site of 
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piasmid pSP72 (Promega), forming pID402. Plasmid pID402 was digested with 
Smal and self-ligated to remove a small fragment containing a BamHl site 
between the Smal sites, resulting in pID405. A DNA fragment of pBR322 
(New England BioLabs) containing the ampicillin resistance gene (amp R )/ori/rop 

5 region (corresponding to bases 1904-4359 of pBR322) was prepared by 
polymerase chain reaction (PCR) amplification using primers pBRl (5 1 - 
GGAGCTC ACTAGTCGG AGGC ATCAGTGACC-3 1 , SEQ ID NO: 95) and 
pBR2 (5 ' -GGGATCCC ATG AG A ATTCTTGA AGACG-3 ' , SEQ ID NO: 96). 
Primer pBRl contains sites for Sacl and Spel, and pBR2 contains sites for 

10 BamHl and EcoRl. The PCR product was digested with BamRl and Sacl and 
ligated to the BamHl/ Sacl-d\gested fragment of pID405 (carrying replicon 
pE194ts and cm R ) to create pID406, in which the amp R /ori region of pSP72 is 
replaced with the replicon of pBR322. Finally, a 6.5 kb BamHl-EcoRl fragment 
from pTV9 [described in Youngman, in Plasmids: a practical approach, K. 

15 Hard, ed., Oxford: IRL Press, pages 79-103 (1985)], containing transposon 
Tn9/7 and its erythromycin (erm) resistance gene (erm R ), was ligated with 
BamHl and EcoRl digested pID406, resulting in plasmid pID408. The origin of 
replication from pBR322 allows pID408 to replicate in E. coli, and the 
temperature-sensitive replicon pE194ts (from pTV32ts) allows replication in S. 

20 aureus at 32°C. 



B. Cloning and Selection of Signature Tags 

DNA signature tags were prepared as described by Hensel et aL, 
Science, 269:400-403 (1995) and inserted into the transposon Tn917 carried by 
25 pID408 as follows. Oligonucleotides tags in pool RT1 (5*- 
CTAGGTACCTACAACCTCAAGCTT-[NK] 20 - 

AAGCTTGGTTAGAATGGGTACCATG-3', SEQ ID NO: 97, where N is A, 

C, G or T, and K is G or T were synthesized on an oligonucleotide synthesizer 
(Applied Biosystems). The 80bp tags from, oligonucleotide pool RT1 were 

30 PCR-amplified from the using primers P10 (5 1 - 
CTAGA ATTCTAC A ACCTC A AGCTT-3 ' , SEQ ID NO: 98) and Pll (5'- 
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AAGCTTGGTTAGAATGGAATTCATG-3' , SEQ ID NO: 99). The PCR- 
amplified tags were digested with EcoRl, gel-purified, and ligated with EcoRI 
digested, dephosphorylated pID408, to form plasmids containing uniquely 
signature-tagged Tn9/7 transposons. The ligated DNA was transformed into S. 

5 aureus strain RN4220 (a restriction defective strain derived from NCTC 8325-4 
and described in Youngman, in Plasmids: a practical approach, K. Hard, ed., 
Oxford: IRL Press, pages 79-103 (1985) by electroporation. Electroporation 
was performed by the method of Schenk and Laddaga, FEMS Microbiol Lett,, 
94:133-138 (1992), with the following modifications: electrotransformed 

10 bacteria were plated on non-selective brain-heart infusion (BHI, Difco) agar 
plates (1.5% agar), grown at 32°C for 8 to 12 hours and then replicated using an 
Accutran Replica Plater (Schleicher and Schuell) onto BHI agar plates containing 
20 (ig/ml erm and 20 \ig/m\ cm, and grown at 32°C overnight. 

A total of 400 erm R , cm R transformants were then screened to 

15 identify 150 tags that amplified and labelled efficiently. The 150 transformants 
were each cultured in 10 ml BHI broth containing 20 ng/ml erm and 20 (ig/ml 
cm at 32°C overnight. Plasmid DNA was extracted from each culture using a 
Qiagen Plasmid Miniprep Kit according to the manufacturer's protocol except 
that the bacterial cells were lysed by lysostaphin (Sigma; 200 mg/ml) at 37°C for 

20 30 min. in the PI solution of the Qiagen miniprep kit. An aliquot of each 
plasmid DNA preparation (1 \xg) was transferred onto a Hybond N + membrane 
(Amersham) by using a Bio-Dot Microfiltration Apparatus according to the 
manufacturer's protocol (Bio-Rad) to give 75 different plasmids per membrane. 
The membranes were then transferred to Whatman 3MM paper saturated with 

25 denaturing solution (0.5 N NaOH, 1.5 M NaCl), incubated for 5 min. then 
transferred to another piece of Whatman 3MM paper saturated with neutralizing 
solution (1.5 M NaOH, 0.5 M Tris-Cl pH 7.4) for 5 min. After neutralization, 
the membranes were dried at 80°C for 10 min and the DNA cross-linked by UV 
light using a Stratalinker (Stratagene). The signature tags from the 150 

30 transformants were also labelled for use as probes. Aliquots of the plasmid 
preparations were combined to form four pools of 37, 37, 38 and 38 plasmids. 
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The DNA signature tags from these four pools were PCR-amplified with 
primers P12 (5'-GAATTCCATTCTAAC-3\ SEQ ID NO: 100) and P13 (5'- 
ATTCC ATTCTA ACC A AGC-3 ' , SEQ ID NO: 101) according to Hensel et aL, 
Science, 269:400-403 (1995). These PCR products were gel-purified, subjected 

5 to a digoxygenin (DIG)-labelling PCR using a PCR DIG Probe Synthesis Kit as 
described by the manufacturer (Boehringer Mannheim). The DIG-labelled PCR 
products were hybridized separately with the membranes according to the 
method described in the DIG Probe Synthesis Kit, to identify tags that gave 
strong hybridization signals, but which did not cross-hybridize with tags in the 

10 other pools. From these experiments, a total of 96 plasmids were chosen on the 
basis of labelling efficiency of tags and lack of cross-hybridization. 

C. Generation of the S. aureus Mutant Bank 

The 96 plasmids containing uniquely signature tagged-transposons 

15 were used to generate S. aureus mutants with a chromosomal transposon 
insertion. A series of identical membranes for dot-blot hybridizations were 
prepared by transferring 1 \xg of each of the 96 plasmids onto Hybond N + 
membranes using the Bio-Dot Microfiltration Apparatus. The 96 plasmids were 
separately transformed into S. aureus strain RN6390 by electroporation [Schenk 

20 and Laddaga, FEMS Microbiol. Lett., 94:133-138 (1992)]. Electrotransformed 
bacteria were plated directly on BHI agar plates containing 20 |ig/ml cm and 20 
\iglm\ erm at 32°C overnight. A single colony from each transformation was 
transferred into each well of a microtitre dish containing 200 \x\ BHI broth, 20 
(ig/ml erm and 20 yg/ml cm. The microtitre dish was incubated at 32°C 

25 overnight. Glycerol was added to each well to give a final concentration of 50% 
and the plates were stored at "80°C. This collection of 96 trans formants was 
designated the master pool and was used for all subsequent mutagenesis. 

To generate 96 different Tn97 7 mutants, bacteria from the master 
pool were replicated using a microtitre dish replicator (Sigma) into the wells of a 

30 new microtitre dish containing 200 ^1 BHI broth and 20 \ig/m\ erm. This dish 
was incubated at 43°C overnight and then bacteria from each well were streaked 
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on BHI agar containing 20 |ig/ml erm and incubated at 43°C overnight to obtain 
single colonies. Approximately five different erm R colonies obtained from each 
well were transferred onto BHI agar containing 20 (ig/ml cm and incubated at 
32°C, and onto BHI agar containing 20 \ig/m\ erm and incubated at 43°C, to 

5 screen for cm s colonies. Erm R , cm colonies should lack the plasmid and carry 
a chromosomal insertion of TnP77. Individual erm R , cm s colonies from each of 
the 96 wells were transferred into a new 96 well raicrotitre dish with BHI broth 
containing 20 |ig/ml erm and grown overnight at 43°C for use as inoculum. 
These 96 mutants, taken together, form one "inoculum pool." Mutant pools 

10 were also stored at *80°C in 50% glycerol. 

To verify that erm R , cm s colonies lack the plasmid and carry a 
single chromosomal insertion of Tn917, chromosomal DNA samples from 
individual mutant strains and from a pool of 46 colonies that had been cultured 
separately were digested with EcoRl and subjected to Southern analysis using a 

15 fragment of the P-lactamase gene as a probe. For each of 12 separately 
analyzed mutants, a single hybridizing fragment of different size was observed 
in each lane. A large number of hybridizing fragments were observed in the 
lane containing DNA from 46 mutants. These results indicate that the majority 
of Tn917 insertions occur singly and at different locations in the S. aureus 

20 chromosome. 

D. Infection Studies to Identify Mutants with Reduced Virulence 

The "inoculum pool" of 96 & aureus mutants containing 
chromosomal transposon insertions was evaluated for attenuated virulence in a 

25 mouse model of bacteremia. After overnight incubation at 43°C, the 96 
different mutants from individual wells of the microtitre dish were pooled 
together and washed twice with BHI broth by centrifugation at 4000 x g for 10 
min and resuspended in BHI broth. The OD 620 was determined using a 
spectrophotometer (OD 620 of 1.6 x 10 9 colony forming units (CFU)/ml) [Cheung 

30 et aL y J. BacterioL, 777:3220-3226 (1995).]. The bacterial suspension was 
diluted to approximately 5xl0 6 CFU/ml and then mixed with an equal volume of 
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pre-autoclaved 4% (w/v) Brewer's yeast (Sigma) in BHI broth. A total of 0.2 
ml of this mixture, containing approximately 5xl0 5 CFU bacteria, was injected 
intraperitoneal^ into a CD-I mouse (approximately 25 g in weight). The 
number of CFU in the inoculum was verified by viable counts after plating a 

5 diluted aliquot of the inoculum to BHI agar. Two to four mice were inoculated 
with each pool. 48 hours after inoculation, bacteria were recovered from the 
spleens of the animals as described by Hensel ei aL, Science, 269:400-403 
(1995) and pooled to form the "recovered pool." Each recovered pool was 
made from at least 10,000 bacterial colonies. The signature tags present in the 

10 recovered pools were compared with the signature tags present in the inoculum 
pools by PCR amplifying the tags using primers P12 and PI 3 (described in 
section B above), DIG labelling the tags, and hybridizing the labelled tags to the 
96 plasmids which had been transferred to Hybond N + membranes using the 
Bio-Dot Microfiltration Apparatus as described above in section C. 

15 

E. Virulence Gene Identification and DNA Sequencing 

A total of 13 pools, each comprising 96 S. aureus Tn97 7 mutants, 
were screened as described above for loss of virulence in mice. From these, 50 
mutants were identified whose tags hybridized strongly to probes from the 

20 inoculum pools but weakly to probes from the corresponding recovery pools. 

To clone the chromosomal DNA flanking the transposon insertion 
points of these 50 mutants, 3-5 \xg of S. aureus chromosomal DNA from each 
mutant was isolated as described by Pospiech and Neumann, Trends Genetc, 
77:217-218 (1995) and completely digested with Hindlll. Half of the digested 

25 DNA was then subjected to Southern hybridization analysis using the pBR322 
fragment of pID408 as a probe to determine the size of the DNA fragment 
carrying this part of the transposon. The rest of the DNA was resuspended in 
200 \xl of ligation buffer (Gibco-BRL) and self-ligated overnight at 16°C. The 
ligated products were transformed into £. coli DH5a (Gibco) and plated onto 

30 Luria Bertani (LB) agar containing 50 (ig/ml ampicillin, and incubated at 37°C 
overnight. A single amp R colony from each transformation was grown up in LB 
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broth containing 100 \xg/ml amp. Plasmid DNA was extracted using a Qiagen 
plasmid miniprep kit. Chromosomal DNA flanking the transposon was obtained 
using primer pseq-1 (5'-TGAACTGCCACTGTAGAGAGA-3\ SEQ ID NO: 
102) based on the erm-proximal end of Tn9/7, and sequenced (for a stretch of 

5 several hundred nucleotides) using a Model 373A Sequencing System (Applied 
Biosystems). The sequence obtained is indicated in Table 1 below with 
reference to the corresponding nucleotides of the SEQ ID NO:. 

These DNA sequences were analyzed by searching the S. aureus 
database from Human Genome Sciences and the European Molecular Biology 

10 Laboratory and Genbank DNA and Protein Databases using the BLAST and 
FASTA network service at the Human Genome Mapping Project Recourse 
Centre, Hinxton, UK. Results of the searches done to date in identifying the 
virulence genes and their possible function are shown in Table 1 . Table 1 below 
displays the signature tag identification number, the SEQ ID NO: corresponding 

15 to each virulence gene (and the nucleotide positions within the SEQ ID NO: that 
correspond to the actual stretch of DNA that was sequenced), the possible 
function of the gene, and the LD 50 of the knockout mutant as determined in 
Example 2 below. 



20 EXAMPLE 2 

Assessment of virulence of S. aureus knockout mutants 
The virulence of individual knockout mutants, identified in 
Example 1 as having reduced virulence when tested in pools of 96 mutants, was 
assessed by carrying out an LD 50 determination for each mutant. Confirmation 

25 that a given mutation is in a virulence gene may be obtained by comparing the 
parent and mutated bacterial strains in terms of whether both strains are equally 
effective in establishing an infection with the same consequences for the host 
animal. In practice, this may be done by comparing LD 50 values (the number of 
bacteria required to produce a 50% mortality rate in the animals under 

30 standardized conditions) between the wild-type strain and the mutant derivative 
strain. If the LD 50 values are within standard error of each other then the 
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mutation is not in or does not affect a virulence gene. If significant differences 
occur between the wild- type and the mutant strain, where the mutant strain is 
significantly less able to cause lethal infection, then the mutation is in a 
virulence gene or dramatically affects a virulence gene. 

Briefly, LD^ determinations were conducted as follows. CF1 
female mice weighing between 19 and 22 grams were injected intraperitoneal^ 
with a set concentration of either wild-type or mutant S. aureus in 0.2 ml of BHI 
broth containing 4% (w/v) dried brewer's yeast as an adjuvant for establishment 
of infection. The amount of brewer's yeast may vary from 0-8% dependent 
upon the bacteria being utilized in the infection. A single LD 50 determination 
utilizes five log-dilutions of bacteria (10 3 to 10 7 ), with each log-dilution being 
tested in 10 mice. The actual number of bacteria employed was estimated in 
each LD 50 determination by a plate count conducted on the bacterial stock to 
determine the number of CFU of bacteria. Following infection, the mice were 
monitored daily for mortality for a period of at least one week. At the end of 
the observation period the LD 50 was determined using probit analysis on the 
mortality data. The LD 50 value of the wild-type 5. aureus strain RN6390 ranges 
from 1.4 x 10 4 to 1.4 x 10 5 . The LD 50 values for the virulence gene mutants are 
shown in Table 1 below. 
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TABLE 1 - VIRULENCE GENE SEQUENCES AND POSSIBLE FUNCTION 



SIGNA- 
TURE 
TAG ID 


SEQID 

NO: 

(DNA; 

amino 

acid) 


Position 
of DNA 
sequence 
obtained 
in 

Example 1 


IDENTIFICATION OF POSSIBLE 
FUNCTION OF GENE 


LD 50 of 
knockout 
mutant 
(#CFU)* 


p2c73 


SEQID 

NO: 1; 
2 


517-756 


FemA/B like with '40% identity 


>6.8 x 
in 8 


p2c90 


SEQID 
NO: 3 


281-431 


Unknown; near Opp operon 




p9b74 


SEQ ID 
NO: 4; 
5 


593-711 


Tryptophan synthase alpha chain 
TrpA 




pllc29 


SEQID 

NO: 6; 
7 


1107- 
1260 


FemB 




pl3c83 


SEQID 
NO: 6; 
7 


1107- 
1260 


FemB 


3.9 x 10 s 


p4cl5 


SEQ ID 

NO: 8; 
9 & 10 


360-654 


MarR/LuxR-like regulatory protein 




p6c63 


SEQID 
NO: 8; 
9& 10 


388-659 


MarR/LuxR-like regulatory protein 


>5.4 x 
10 8 


p5c4 


SEQID 
NO: 11; 
12 


192-481 


41% identity to nitrate reductase 
NirQ 




p9b66 


SEQID 
NO: 13; 
14 


221-474 


Peptide methionine sulfoxide 
reductase 


4.0 x 10 4 
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SIGNA- 
TURE 
TAG ID 


SEQ ID 

NO: 

(DNA; 

amino 

acid) 


Position 
of DNA 
sequence 
obtained 
in 

Example 1 


IDENTIFICATION OF POSSIBLE 
FUNCTION OF GENE 


LDjoof 
knockout 
mutant 
(#CFU)* 


pl0cl5 


SEQID 
NO: 17 
& 19; 
18&20 


562-825 


Oligopeptide transporter OppD 


>2.6 x 
10 8 


pl3b74 


SEQID 
NO: 21 


all 


Possible stem-loop termination 
sequence; no obvious ORF 


8.8 x 10 4 


pl3c72 


SEQID 
NO: 21 


all 


Possible stem-loop termination 
sequence; no obvious ORF 




pl4cl5 


SEQID 

NO: 22; 
23&24 


669-1009 


C-term of aspartokinase 2 alpha 
subunit and N-term of aspartate 
semialdeyde dehydrogenase Asd; 
both involved in homoserine 
synthesis 




pl3b26 


SEQ ID 

NO: 25; 
26 


442-819 


homoserine kinase 


7.7 x 10 4 


p7cl8 


SEQ ID 

NO: 27; 
28 


667-847 


diaminopimelate decarboxylase 
LysA; lysine synthesis 




pl5c31 


SEQ ID 

NO: 29; 
30 


1594- 
2018 


phosphoribosylformylglyinamide 
decarboxylase PurL; purine 
synthesis 




pl0bl8 


SEQ ID 

NO: 31; 
32 


3-404 


tryptophan synthase alpha chain 
trpA 


>5.2 x 
10 8 


P 6bl8 


SEQID 

NO: 31; 
32 


3-404 


tryptophan synthase alpha chain 
trpA 
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.SIGNA- 
TURE 
TAG ID 


SEQ ID 

NO: 

(DNA; 

amino 

acid) 


Position 
of DNA 
sequence 
obtained 
in 

Example 1 


■ 

IDENTIFICATION OF POSSIBLE 
FUNCTION OF GENE 


LD 50 of 
knockout 
mutant 
(#CFU)* 


pl0b66 


SEQ ID 

NO: 33; 
34 


30-282 


tryptophan synthase beta chain trpB 


>4.2 x 
10 6 


pl0c34 


SEQID 

NO: 35; 
36 


609-817 1 


Anthranilate phosphoribosyl 
transferase TrpD; tryptophan 
synthesis 


> 4 x 10 6 


p4c27 


SEQ ID 

NO: 37; 
38 


1130- 
1254 


dihydrolipoamide succinyl 
transferase component(e2) of 2- 
oxoglutarate dehydrogenase 
complex in TCA cycle 




P 4c52 


SEQID 

NO: 39; 
40 


498-738 


dihydrolipoamide succinyl 
transferase component(e2) of 2- 
oxoglutarate dehydrogenase 
complex in TCA cycle 




pl0b2 


SEQ ID 

NO: 41; 
42 


880-1159 


dihydrolipoamide succinyl 
transferase component(e2) of 2- 
oxoglutarate dehydrogenase 
complex in TCA cycle 


>2.6 x 
10 8 | 


p]0c20 


SEQID 
NO: 43 


all 


Sequence 3' to the dihydrolipoamide 
succinyl transferase component(e2) 
of 2-oxoglutarate dehydrogenase 
complex in TCA cycle: no obvious 
ORF 


3.7 x 10 5 


pl2c32 


SEQID 
NO: 44; 
45 


188-438 


Alpha-ketoglutarate dehydrogenase 
(el) of the 2-oxoglutarate 
dehydrogenase complex in TCA 
cycle; acetyl CoA synthase 


>1.0x 
10 6 


p!0b30 


SEQ ID 

NO: 46; 
47 


2419- 
2574 


Heat shock protein ClpB 


1.4 x 10 4 
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SIGNA- 
TURE 
TAG ID 


SEQ ID 

NO: 

(DNA; 

amino 

acid) 


Position 
of DNA 
sequence 
obtained 
in 

Example 1 


IDENTIFICATION OF POSSIBLE 
FUNCTION OF GENE 


LD 50 of 
knockout 
mutant 
(#CFU)* 


pl3c3 


SEQ ID 
NO: 48; 
49 


454-765 


35% identity to hypothetical 45.9 
kDa protein from B. subtilis 
ImpB/MucB/SamB family 


1.8 x 10 4 


p4b3 


SEQ ID 

NO: 50; 
51 


790-1203 


Unknown; homology to hypothetical 
proteins from B. subtilis YAAD and 
YAAE 




p4c63 


SEQ ID 

NO: 52; 
53 


576-806 


Unknown; metallopeptidase motif 




p5c3 


SEQ ID 

NO: 54; 
55 


1-450 


Unknown; membrane transporter 
motif 


7.1 x 10 4 


p8d26 


SEQ ID 

NO: 56 

&58; 

57&59 


32-262 


Unknown; 38% identity to C. 
elegans ORF 




p9b65 


SEQ ID 
NO: 60; 
61 


1-361 


sodium/proton dependent alanine 
carrier protein 


> 9.2 x 
10 8 


plOb32 


SEQ ID 

NO: 62; 
63 


205-296 


Unknown 




plOb85 


SEQ ID 
NO: 64 


38-212 


Unknown 




P 10b89 


SEQ ID 

NO: 65; 
66 


1-205 


Unknown; prokaryotic membrane 
lipoprotein lipid attachment motif 


>2.2 x 
10 8 


p!0c30 


SEQ ID 

NO: 67; 


730-1041 


Unknown; sensor-type protein motif 

I 


8.8 x 10 s 
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SIGNA- 

T"T TT*» T~" 

TURE 
TAG ID 


SEQ ID 

NO: 

(DNA; 

amino 

acid) 

68 


Position 
of DNA 
sequence 
obtained 
in 

Example 1 


IDENTIFICATION OF POSSIBLE 
FUNCTION OF GENE 


LD 50 of 
knockout 
mutant 
(#CFU)* 


pl0c52 


SEQ ID 

NO: 69; 
70 


1-262 


Unknown 




pl0d9 


SEQ ID 

NO: 71; 
72 


422-639 


Unknown; enterotoxin-Iike motif 




pllcl2 


SEQ ID 

NO: 73; 
74 


1-150 


Unknown; 40 % identity to 
Mycoplasma mycoides hypothetical 
protein in ffh 5' region q01444 


>4.2x 
10 6 


pllc66 


SEQ ID 
NO: 75 
&77; 
76&78 


292-529 


Unknown; near Opp operon 




p5c34 


SEQ ID 

NO: 79; 
80 


1180- 
1236 


glycine betaine transporter; region 
55% identity to B. subtilis p54417 




pl0cl8 


SEQ ID 
NO: 81 


1-477 


Unknown-potential ORF in 120 bp 




pl2c3 


SEQ ID 

NO: 82; 
83 


1-318 


acetyl-CoA synthetase; region 59% 
identity to B. subtilis p39062 




pl4b25 


SEQ ID 

NO: 84; 

PC 
OJ 


1-148 


Unknown; exonuclease-like; 25% 
identity to E. coli pi 3458; contains 

A 'I'll L!_ J!_ n _ . i'f 

Air binding motif 




p!4b74 


SEQ ID 

NO: 86; 
87 


243-624 


Unknown; 51% identity to 
hypothetical 45.9 kd protein(YQJW) 
from 5. subtilis p54560 





t 
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SIGNA- 
TURE 
TAG ID 


SEQ ID 

NO: 

(DNA; 

amino 

acid) 


Position 
of DNA 
sequence 
obtained 
in 

Example 1 


IDENTIFICATION OF POSSIBLE 
FUNCTION OF GENE 


LD 50 of 
knockout 
mutant 
(#CFU)* 


pl4cl3 


SEQ ID 

NO: 88 


1-441 


Unknown; near Opp operon 




pl5b9 


SEQ ID 

NO: 89; 
90 


654-867 


Unknown; 41% identity to 
hypothetical 33.7 kd protein 
(YHCT) from B. subtilis p54604 




pl5b32 


SEQ ID 

NO: 91; 
92 


286-370 


I Jnknnwrr simitar tn nrfl V nf 

acvB of Agro. tumefaciens a36922 




p!5c4 


SEQ ID 

NO: 93; 
94 


25-192 


4-oxalocrotonate tautomerase; 42% 
identity to Pseudo. putida a43397 





*The LD 50 for wild-type S. aureus RN6390 is normally 1.4 x 10 4 to 1.4 x 10 5 . 



EXAMPLE 3 

Use of virulence gene products in screen for anti-bacterial agents 
5 The virulence genes and their gene products are utilized in assays 

for identifying new anti-bacterial agents against 5. aureus. The genes are 
cloned, the proteins encoded by the genes are produced and purified, high 
throughput assays are established to screen for inhibitors, and the inhibitors 
identified in the primary screen are evaluated in secondary assays. The cloning 
10 and expression of FetnA, and its use in screening potential inhibitory agents, is 
described below. 



A. Cloning of FeniA Gene 

DNA restriction digestions and ligations are performed as 
15 described by Sambrook el aL, Molecular cloning: a laboratory manual. 2nd ed., 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1989). 
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Cloning of the S. aureus femA gene [Berger-Bachi et ai, Mol. Gen. Genet., 
2/9:263-269 (1989)] for intracellular expression in E. coli is carried out as 
follows. To clone the gene with six histidine residues at the 3'-terminus, the 
following two oligonucleotides primers, 

5 5'-GGCCATCGATAATGAAATTAATTAACGAGAGACAAATAGG-3 , J SEQ 
ID NO: 103, and 

GGCCGGATCCCTAGTGATGGTGATGGTGATGAAAAATTCTGTCTTTAA 
CTTTTTT-3 ' , SEQ ID NO: 104, are used to PCR amplify the gene from S. 
10 aureus genomic DNA. The resulting femA gene has a Cla\ site at 5*-end and a 
BamUl site at 3*-end for cloning into E. coli expression vectors. For secretion 
of the femA gene with six histidine residues at the 3 '-terminus into the periplasm 
of E. coli, PCR amplification using the following two oligonucleotide primers, 

5 , -GGCCGGTACCAAATTAATTAACGAGAGACAAATAGG-3 f , SEQ ID 
15 NO: 105 and 
5'- 

GGCCGGATCCCTAGTGATGGTGATGGTGATGAAAAATTCTGTCTTTAA 
CTTTTTT-3', SEQ ID NO: 106, 

results in a fragment with a Kpn\ site at the 5* -end and a BamUl site at the 3'- 
20 end. The approximately 1.3 kb PCR fragment may be sequenced directly, or 
after cloning into an expression vector, to confirm the sequence of the femA 
gene and its flanking regions. 



25 B. Recombinant FemA Protein Production and Purification 

Expression of femA in E. coli is carried out as follows. The E. 
coli expression vector pSRtac-pelB contains the synthetic tac promoter and the 
signal sequence of PelB [Lei et aL, J. Baa., 769:4379-4383 (1987)], with 
transcription terminators upstream and downstream to minimize transcription 

30 read through. For intracellular expression, the femA gene generated by PCR as 
described above in section A with Cldl and BamUl sites at the termini is cloned 
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into pSRtaopelB digested with Cla\ and BamHl (replacing the PelB signal 
sequence). For secretion into periplasm, the femA gene generated by PCR as 
described above in section A with Kpnl and BamHl sites at the termini is cloned 
into pSRtac-pelB digested with Kpnl and Bamttl, directly downstream from the 

5 pelB signal sequence. E. coli are transformed with the expression vector 
containing thtfemA gene. To induce the tac promoter and optimize expression 
level, the transformed bacteria are incubated with IPTG (isopropylthio-p- 
galactoside) at concentrations ranging from 2 x 10" 3 M to 1 x 10" 5 M, at cell 
growth temperatures between 27°C to 42°C in rich medium or minimal medium. 

10 The recombinant FemA thus produced contains six histidine 

residues at the C-terminus and can be purified via the affinity of histidine to 
nickel. Chromatography may be carried out with the metal chelating resin Ni- 
nitrilotriacetic acid (Ni-NTA) (available from Qiagen, Chatsworth, CA) under 
native or denaturing conditions. 

15 

C. High Throughput Assays Using FemA for Screening Potential Inhibitors 

The screening method described in U.S. Patent No. 5,585,277 is 
used as follows to establish a high throughput assay to identify ligands that bind 
to FemA. The test ligand and femA are combined under conditions (e.g., 

20 temperature, pH, salt concentration, time) appropriate for detecting binding of 
femA to a ligand. If too little target protein is unfolded, the observed signal will 
occur at too low a level or rate to be conveniently measured. The conditions are 
optimized using known methods. Binding of a test ligand to femA is assessed in 
one of several ways: by determining the extent to which folded femA is present 

25 in the test ligand-/<?m/i combination, by determining the extent to which unfolded 
femA is present in the test ligand-/em/l combination, or by determining the ratio 
of folded femA to unfolded femA in the combination. There are numerous 
methods to carry out these determinations. For example, proteolysis may be 
used. A protease which acts preferentially on unfolded femA is combined with 

30 the test ligand-/£?A7i/i combination, and after an appropriate period of incubation, 
the difference between intact or degraded femA in the presence and in the 
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absence of the test ligand is determined. The addition of a test ligand which 
binds the folded /*m/l, thus stabilizing it in the protease-resistant form, changes 
the rate of proteolysis. A wide variety of known proteases, such as trypsin, 
chymotrypsin, V8 protease, elastase, carboxy peptidase, proteinase K, 
5 thermolysin and subtilisin, can be used. As another example, the binding of the 
test ligand to the femA is assessed through the use of antibodies that specifically 
bind to the protein only in the unfolded state. There are numerous methods 
known in the art for producing antibodies to a particular protein. 



10 D. In Vivo Assays to Evaluate Inhibitors 

The inhibitors/binders identified by the initial screens such as the 
one described in section C above are evaluated for their effect on virulence in 
mouse models of S. aureus infection. Mice are infected with 100 LD 50 's of the 
wild-type bacteria and treated with varying doses of the test inhibitory agent. 

15 The ED 50 of the test inhibitory agent (the amount in milligrams of drug per 
kilogram of body weight required to cure 50% of infected animals) is 
determined. Each trial contains two control groups, a negative control group 
given an LD 50 dose of bacteria and no test inhibitory agent (which ensures that 
the infecting dose is operationally close to 100 LD 50 's), and a positive control 

20 group given a 100 LD 50 dose of bacteria and an ED 50 dose of a known and 
effective antibiotic for that bacterial infection (which confirms that the infection 
can be cured under appropriate conditions). 

Five dosage levels of test inhibitory agent, utilizing serial two- 
fold dilutions, are employed. An exemplary dosage scheme begins with a 

25 minimum of 6.25 mg of drug per kg of body weight per dose, and ranges up to 
a maximum dose of 100 mg/kg. Each dosage level is tested in six mice and 
deaths following infection and treatment are monitored for at least 6 days. At 
the end of the test period, probit analysis is employed to determine the ED 50 
value of the inhibitor or the amount of drug in milligrams of drug per kilogram 

30 of body weight required to reverse virulence so as to result in only 50% 
mortality of infected animals in the test. In tests designed to detect the 
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prevention of infection by interference with virulence factors, the first dose of 
the inhibitor is given one hour prior to infection, and then four and eight hours 
post infection. In tests designed to detect virulence inhibitors which reverse the 
outcome of infection, the inhibitor is administered one and four hours post 
5 infection on the first day of infection and once in the morning and afternoon 
(separated by 6-8 hours) on the second and third days following infection. 
Inhibitory agents that successfully prevent the establishment of an infection or 
reverse the outcome of an infection once established are identified. 

10 Example 4: Svr, a virulence regulator of Staphylococcus aureus 

The identification of large number of putative virulence genes in Staphylococcus 
aureus using signature-tagged mutagenesis is reported in Examples 1 to 3. 
Many of these had no significant similarity to sequences in the DNA and protein 
databases. To investigate further the role of these genes in virulence, mutant 

15 strains were subjected to a series of phenotypic tests. One mutant strain was 
found with pleiotropic effects on virulence factors. We thus designated the 
mutated gene in this strain staphylococcal virulence regulator (svr). Compared 
with its parent wild-type strain, the svr mutant strain expressed greatly reduced 
amounts of a-, P- and 5-toxins but increased amounts of protein A. DNA 

20 sequence analysis of the cloned svr gene did not reveal any significant 
similarities to entries in the DNA and protein databases. Northern hybridization 
with probes specific for hla, the gene encoding a-toxin, and spa, the gene 
encoding protein A, showed that svr affected the expression of a-toxin and 
protein A at the mRNA level. Northern analysis of svr, agr and RNAIII 

25 transcripts showed that agr mRNA and RNAIII were detectable in the wild-type 
strain and an svr mutant strain carrying the svr gene on a plasmid but was 
absent in the agr* and svr strains. Svr transcripts were present in wild-type 
and agr- strains but were not detected in the svr~ strain. This indicates that svr 
is required for the expression of agr and RNAIII. The svr product is therefore 



10 
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likely to be a novel compenent in the agr regulatory network controlling 
virulence of S. aureus. 

METHODS 

Bacterial strains and plasmids 

Bacterial strains and plasmids are listed in Table 1. S. aureus strains were 
grown in Brain Heart Infusion (BHI) medium (Difco) with or without agar 
(1.5%) and antibiotics (erythromycin [erm] at 20 jig/ml and/or chloramphenicol 
[cm] at 20 fxg/ml). E. coli strain DH5a was grown in Luria Bertani (LB) 
medium with or without ampicillin famp] at 50 iiglmY). 



Table 1 . Bacterial strains and plasmids used in this study 
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Table 1. Bacterial strains and plasmids used in this study 



Strain or plasmid Phenotype or characteristics 



Source or reference 



Strains 
E. coli DH5a 



S. aureus 
RN4220 
RN6390 
RN6911 
P6C63 
Newman 
Wood46 
ID401 



ID402 



Plasmids 
pBR322 
pSP72 
pVA380-l 
pCW59 
pID431 

pID413 

pID437 
pID439 
pID4311 

pID4312 



V-supE44 AlacU169 (4>801acZ AM 15) 
hsdR17 recAl gyrthi-1 



NCTC8325-4, restriction minus. 

Propagated laboratory strain 

RN6390Azgr. 

svr strain, selected by STM 

Wild-type strain 

An oc-hemolysin-producing strain- 
Clinical isolate from Hammersmith 
Hospital, Mec r 

Clinical isolate from Hammersmith 
Hospital, Mec r 



Cloning vector, Amp r , Tc r / 
Cloning vector, Amp r . 
Cloning vector of S. aureus. Tc r . 
Cloning vector, Tc' 

Screened from pBR322 S .aureus library. 

Contains svr t orf2 and orf3. 

Cloning vector, derived from pVA380- 

l,Tc r . 

pID413 carrying svr. 

pID413 carrying svr, orf2 and orf3. 

pID413 carrying svr and orf2 plus 287 bp 

upstream sequence of orf2 

pID413 carrying svr y orf2 and orf3 plus 

287 bp upstream sequence of orf2. 



Gibco BRL 



Peng etaL 1988 
Peng et aL 1988 
Peng et al. 1988 
Examples 1 to 3 
NCTC 10833 
NCTC 10345 
This study 

This study 



New England BioLab 
Gibco BRL 
Macrina et al. 1980 
Wilson*/ aL 1981 
This study 

This study 

This study 
This study 
This study 

This study 



DNA manipulations, PCR, digoxygenin labelling and hybridizations 
Chromosomal DNA from Staphylococcus aureus was isolated as described by 
Pospiech et aL, (8). DNA restriction digestions and modifications were 
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performed as described by Sambrook et aL, (9). Plasmid DNA from 5. aureus 
strain RN4220 was isolated using a Qiagen Plasmid Miniprep Kit according to 
the manufacturer's protocol except that the bacterial cells were lysed by 
lysostaphin (Sigma; 200 mg/ml) at 37°C for 30 min prior to plasmid 
5 purification. 

Construction ofplasmids 

Plasmid pID413 was derived from pVA380-l(10). A 2.5 kb fragment carrying 
the pVA380-l replicon was PCR-amplified from pVA380-l. Restriction sites for 

10 Bglll and HindlU were introduced in the 5' end of the amplified fragment by 
using forward primer 5'-TGGAGATCTAAGCTTTGCATAACTTTCTCGTCC- 
3' (SEQ ID No 107) and reverse primer 5'-TCCTGGCGATTCTGAGAC-3' 
(SEQ ID No 108). The amplified fragment was filled in with DNA polymerase 
Klenow fragment and ligated with a 2.3 kb DNA fragment carrying the 

15 tetracycline resistance gene from pCW59 after digestion by HindlU and filling in 
with DNA polymerase Klenow fragment, resulting in plasmid pID413. A DNA 
polylinker of plasmid pSP72 was digested with Bglll and Hindlll and inserted 
into Bglll and HindlU digested pID413 to generate pID413PL. 

20 Construction of genomic library of S. aureus. 

A S. aureus (RN6390) chromosomal DNA library was constructed in pBR322 as 
follows: chromosomal DNA was partially digested with BamYll and EcoRl to 
an average size of 5 kb and purified by phenol xhloroform extraction. The 
purified DNA fragments were ligated with BamUl and £o?RI digested pBR322. 

25 This ligation product was transformed into E. coli DH5a by electroporation and 
plated on LB agar plate containing 50 |ig/ml ampicillin. 

Complementation of svr mutant P6C63 

Four DNA fragments with different sizes were cloned into pID413PL to 
30 complement mutant strain P6C63 (Fig. 1A). All of these four fragments were 
amplified by PCR using primers based on the DNA sequence flanking the svr 
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gene. Restriction sites for BamHl and Hindlll were introduced in the 5' end and 
the 3' end of each fragment, respectively. The amplified fragments were 
digested by BamHl and Hindlll and inserted into BamHl and Hindlll digested 
pID413PL to generate pID437, pID439, pID4311 and pID4312. The svr gene 

5 was amplified from genomic DNA of RN6390 by PCR using primers 5 9 - 
TGGGGATCCGATA AGTGTG ACTGGTAG-3 * (SEQ ID No 109) and 5'- 
TGGA AGCTTAC ATT ACTTC A A AT A A ATT A-3 ' (SEQ ID No. 110) to 
generate pID437. A 2.1 kb fragment containing svr, ovfl and orf3 was amplified 
by PCR using primers 5 1 -TGGGG ATCCTGC ATATC A A A ATGTTT ATGGC-3 ' 

10 (SEQ ID No. Ill) and 5'-TGGAAGCTTACACATATGCCAATCTCAC-3' 
(SEQ ID No. 112) to generate pID439. A 1.3 kb fragment containing svr and 
orf2 plus 287 bp upstream sequence of orf2 was amplified by PCR using 
primers 5 ' -GTTGG ATCCGCTGTTGTT ACTTTG ATGC-3 ' (SEQ ID No. 113) 
and 5 ' -TGG A AGCTT A C ATT ACTTC A A AT A A ATT A-3 ' (SEQ ID No. 114) to 

15 generate pID4311. A 2.4 kb fragment containing svr, orf2, and orf3 plus 287 bp 
upstream sequence of orf2 was amplified by PCR using primers 5'- 
GTTGG ATCCGCTGTTGTT ACTTTG ATGC-3 ' (SEQ ID No. 115) and 5'- 
TGG A AGCTTAC ACATATGCC A ATCTC AC-3 ' (SEQ ID No 116) to generate 
pID4312. All of these four plasmids were individually transformed into P6C63 

20 by electroporation. Transformants were selected by resistance to tetracycline and 
tested for restoration of wild-type phenotype. 

Phenorypic characterisation 

Lipase activity was assayed on 1% Tween agar plates (Difco). DNase 
25 production was assayed on DNase agar (Difco). Coagulase was assayed by the 
method described by Smeltzer et al. (11). a-, P- and 5-toxins were assayed on 
cross-streaked sheep, rabbit and horse blood agar plates. Protein A was detected 
by Western blot using anti-protein A monoclonal antibody (Sigma). 
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Southern hybridization analysis was performed as described by Sambrook et al 
(9) with DIG-labelling DNA fragments as probes. For Northern hybridization, 
total RNA from S. aureus was isolated by using Quiagen RNA easy kit 
according to the manufacturer's protocol except that the bacterial cells were 

5 lysed by lysostaphin (Sigma; 200 mg/ml) at 37°C for 3-5 min. Equal amounts of 
RNA were separated on 1.2% agarose gels containing 0.66 M formaldehyde and 
transferred onto nitrocellulose membrane. Hybridizations were carried out at 

42°C. All probes were radiolabeled with [a- 32 P]dATP by PCR amplification 
using PCR-generated DNA fragments as templates. Oligonucleotide primers 

10 used for PCR were as follows: amplification of an hla fragment was performed 
with primer HI (5'-ATTTGATATGTCTCAACTGC-3') (SEQ ID No 117) and 
H2 (5'-GCTCTAATTTTTAAGTGAGG-3') (SEQ ID No 118). For 
amplification of spa, primers used were SI (5'- 
TATCTGGTGGCGTAACACCTG-3') (SEQ ID No 119) and S2 (5'- 

15 GATGAAGCCGTTACGTTGTTC-3') (SEQ ID No 120). For agr, primers Al 
(5'-GCCATAAGGATGTGAATGTATG-3') (SEQ ID No 121) and A2 (5'- 
GCATTTGCTAGTTATCTTG-3 ') (SEQ ID No. 122) were used. Primers Rl 
(S'-AGATCTATCAAGGATGTGATGGTT^) (SEQ ID No 123) and R2 (5'- 
GTCATTATACGATTTAGTACAATC-3') (SEQ ID No 124) were used for the 

20 amplification of RNAIII. 



Reverse transcription-polymerase chain reaction (RT-PCR) 
Total RNA (1 mg) from post-exponential phase cultures of bacteria was reverse 
transcribed using First-strand cDNA synthesis kit (Pharmacia Biotech) according 
25 to the manufacturer's instructions. PCR was performed in a volume of 100 ml 
with 10 ml cDNA sample, 200 pmol of each primer and 200 nM of dNTPs and 
2.5 U of Taq-DNA polymerase (Sigma). PCR products were analysed by 
agarose gel electrophoresis. 
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Infection studies 

For single strain infection studies, CD-I female mice (20 g; Charles River Labs) 
were individually injected intraperitoneally with 0.2 ml of a suspension 

5 containing a 5 x 10 5 cfu bacteria and 2% (w/v) Brewer's yeast in BHI broth. At 
6, 24, 48, 72, 96 and 120 hours post-injection, two mice were killed from each 
group. Dilution series of spleen homogenates were spread over BHI agar plates 

and incubated at 37°C overnight. The number of bacterial cfu were counted at 
each time point. 

10 

RESULTS 

Cloning and sequencing of the svr region 

To clone the chromosomal region surrounding the transposon insertion site of 

15 strain P6C63, a genomic library of S. aureus was generated in plasmid 
pBR322. By hybridising colonies from this genomic library with a probe 
consisting of a 0.5 kb DNA flanking the transposon insertion, one positive 
clone was identified. Restriction analysis of this recombinant plasmid revealed a 
2.5 kb Alul fragment which had been disrupted by Tn977 in P6C63. SEQ ID 

20 Nos: 8 and 10 are the nucleotide and amino acid sequences initially identified 
for P6C63. Transposon Tn977 had inserted into one of three open reading 
frames (orfs) in this fragment. This orf was designated svr (Fig. 1A). It is 372 
bp in length and codes for a protein of 124 amino acid residues (Fig. IB; SEQ 
ID No: 125). Upon farther sequencing of P6C63 the nucleotide and amino acid 

25 sequences were determined as SEQ ID No: 126 and SEQ ID No: 127. FASTA 
and BLAST searches of the protein databases revealed that svr and orf3 had no 
significant similarity to known proteins or motifs. However, orf2, immediately 
upstream of svr showed significant similarity (28% identity over 148 amino 
acid residues) to the Escherichia coli slyA, which is a member of the mar 

30 family of transcription factors of Bacillus subtilis. 



SUBSTITUTE SHEET (RULE 26) 



WO 99/01473 PCT/GB98/01974 

-44- 

To establish whether the virulence defect of strain P6C63 was due to 
interruption of svr or a polar effect on orf3 or other genes of a possible operon, 
,a series of plasmids was constructed and transformed into P6C63 to test for 
complementation of defects in a-, p- and 5-toxin production (see below). The 

5 insert in the smallest complementing plasmid (pID437) contained only the svr 
gene (Fig.lA), indicating that the toxin-deficient phenotype is due to mutation 
of the svr gene. When plasmids pID437, pID439, pID4311 and pID4312 were 
transformed into P6C63, only a proportion of transformants showed complete 
restoration of the wild-type phenotype. A Southern analysis was performed on 

10 these transformants to test for the presence of the plasmid in the complemented 
strains, using the svr gene as the probe. Southern hybridization showed that in 
the complemented transformants, the plasmid had integrated into the 
chromosomal DNA (data not shown), whereas the uncomplemented 
transformants carried unintegrated plasmids. 

15 To demonstrate whether other strains of S. aureus, particularly clinical 

isolates also contain the svr gene, a Southern hybridization analysis was 
performed on S. aureus strains RN6390, Newman, Wood 46, ID401 and 
ID402, which were from different clinical sources in the USA and UK. 
Chromosomal DNA from each strain was digested with Hindlll and probed 

20 with the svr gene. A common 8.2 kb fragment was observed in strain RN6390, 
Newman, Wood 46 and ID401, whereas a 10 kb fragment was observed in 
strain ID402 (Fig. 1C). This indicates that svr is widely conserved in S. aureus. 



Phenotypic characterisation 

25 To investigate the svr phenotype in more detail, P6C63 was subjected to a 
number of tests for S. aureus virulence determinants. The expression levels of 
a-, b- and d-toxins were examined on different blood agar plates. The 
expression levels of a-, b-, and d-toxins are greatly reduced in P6C63 and 
RN6911 (an agr strain) compared with their parental strain RN6390, and the 

30 production of toxins was restored in P6C63 when transformed with pID437 
(Fig. 2A). 
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A characteristic of agr~ strains is the overproduction of cell wall protein 
A. In view of the similar toxin-deficient phenotype of agir and svr strains, we 
examined protein A by Western blot using an anti-protein A monoclonal 
antibody. As shown in Fig 2B, protein A was not detectable in the wild-type 
5 strain, was barely detectable in the complemented strain, and was present in 
both agr- and svr- strains. 

Virulence studies 

Mutant P6C63 was originally isolated by STM in a pool of 96 mutants and 
10 LD50 analysis has shown that it is highly attenuated in virulence compared with 
the wild-type strain RN6390 (7). To study the growth kinetics of P6C63 in 
more detail, strains RN6390 and P6C63 were injected intraperitoneal^ into 

groups of CD-I mice at a dose of IxlO 5 cfu of bacteria. Bacteria were 
recovered from spleens at different time points and cfu quantified. The bacterial 

15 load of RN6390 in spleens exceeded 10 6 cfti at 6, 24, 48 and 72 hours post- 
inoculation. The numbers of P6C63 cells increased over the first 6 hours, then 
decreased, and were eventually cleared by 96 hours. Mice inoculated with 
RN6390 appeared very sick compared with those injected with the mutant strain 
and one of 18 mice died in 24 hours. 

20 

Transcriptional analysis of svr 

As attempts to detect svr mRNA in RN6390 by Northern hybridization were 
not successful, RT-PCR was employed to determine the transcription of svr in 
various genetic backgrounds. Total RNA from P6C63 (svr), RN6911 (agr~) 
25 and, RN6390 (wild-type) were subjected to RT-PCR using primers 
corresponding to the svr DNA sequence. As shown in Fig. 4A, svr transcripts 
were detected in RN6911 and RN6390 but not in P6C63. A control RT-PCR 
was performed by using same conditions as above except that reverse 

transcriptase was inactived at 95^C for 5 min. No PCR products were obtained 
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for any of the three strains, indicating that the products shown in Fig.4A were 
not due to DNA contamination. 

Effect ofSVR on transcription of hla and spa. 

5 As the svr mutant strain showed reduced production of a-, 0- and 8-toxins and 
increased levels of protein A, Northern hybridizations were performed to 
determine whether they were affected at the mRNA level. Total RNA extracted 
from post-exponential phase cultures of RN6390, RN6911, P6C63 and the 
complemented strain P6C63 (pID437) were subjected to Northern analysis 

10 using probes specific for hla and spa. With the hla probe, a strong band was 
observed in RNA from the wild-type strain RN6390 and the complemented 

strain but was not observed in either the agr~ strain RN6911 or svr~ strain 
P6C63. The membranes were stripped and reprobed with spa gene, and in 
contrast to hla, a hybridizing band was identified in RNA from RN6911 and 
15 P6C63 but not in strain RN6390 or the complemented strain (Fig. 4B). These 
results are consistent with the protein analysis (above) and indicated that the svr 
affected the expression of a-toxin and protein A at the mRNA level. 

Svr is required for the transcription of agr and RNAIII 
20 Since phenotypic analysis and Northern hybridization results indicated that the 

svr mutant has similar characteristics to those of an agr~ strain, Northern 
hybridizations were performed to investigate whether svr is related to the agr 
regulatory system. Total RNA isolated from post-exponential phase cultures of 
RN6390, RN6911, P6C63 and the complemented strain was subjected to 
25 Northern hybridizations using probes specific for agr and RNAIII. As shown in 
Fig. 4C, both the agr and RNAIII probes hybridized to RNA isolated from the 
wild-type strain and complemented strain, but did not hybridize to RNA from 
the svr mutant strain or the agr~ strain. This result suggests that svr is required 
for the transcription of agr and RNAIII. 

30 
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Using STM, an approach for the identification of bacterial virulence genes (7), 
we have identified svr, a novel staphylococcal virulence regulator whose mutant 
phenotype is similar to that of an agr strain. However, DNA sequence 

5 analysis of svr shows that it not a member of the known agr regulatory system, 
and has no similarity to known proteins in the DNA and protein databases. The 
svr" mutant phenotype was complemented by a plasmid containing svr 
indicating that the phenotypic defect was not due to a polar effect on genes 
downstream of svr. Interestingly, complementation occurred in cells containing 

10 chromosomally integrated plasmid, but not in cells containing non-integrated 
plasmids. The reason for this is unclear, but as the plasmid copy number is in 
the order of 15-25/cell (10), it could be related to over-expression of svr. 
Certainly the abundance of svr mRNA seems to be very low in wild-type cells, 
as it was not detectable by Northern hybridization using RNA recovered from 

15 bacteria at various stages of growth. 

The phenotypic and Northern hybridization analysis of toxins and protein 

A in the svr mutant showed it is similar pattern to that of an agr~ mutant strain. 
We therefore asked whether svr is linked with agr regulatory system. The svr 

mRNA was detected in an agr~ strain and wild-type strain but not in the svr" 
20 strain, suggesting that agr has no effect on svr activity. By contrast, agr 

mRNA and RNAIII were detected in the wild-type strain and jvr-complemented 
strain, but not in the svr" strain. This suggests that svr is required for the 
expression of agr and RNAIII. 

It was recently shown that agr is autoinduced by a proteinaceous factor 

25 called RAP (RNAIII activating protein), which is produced and secreted by S. 
aureus. (16). Although the sequence of RAP has not been published, it seems 
unlikely that svr encodes the RAP protein, because the length of svr is 372 bp, 
which could encode a 13 kDa protein, and RAP is a 38 kDa protein (16). The 
fact that sar is also necessary for the expression of agr and RNAIII (6) indicates 

30 that three distinct regulatory pathways could influence the expression of the agr 
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locus and virulence factors of S. aureus. Studies on the relationship between svr, 
sar and RAP will help to clarify the mechanisms by which& aureus controls 
virulence gene expression. 

The recent emergence of methicillin resistant S. aureus (MRSA) 

5 represents a very important public health problem (18). Thus there is an urgent 
need for alternative approaches to control S. aureus infections. Balaban et al. 
(17) reported that mice vaccinated with RAP were protected from infection by a 
subsequent S. aureus challenge. The fact that svr is required for virulence, most 
likely through regulation of agr and RNAIII, suggests that svr might be another 

10 target for the development of antibiotics and vaccines against S. aureus 
infections. 
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Numerous modifications and variations of the above-described 
invention are expected to occur to those of skill in the art. Accordingly, only 
15 such limitations as appear in the appended claims should be placed thereon. 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

(i) APPLICANT: Holden, David W. 
(iil TITLE OF INVENTION: ANTI -BACTERIAL METHODS AND MATERIALS 



(iii) NUMBER OF SEQUENCES: 12 7 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : Eric Potter Clarkson 

(B) STREET: Park View House, 58 The Ropewalk 

(C) CITY: Nottingham 

(D) STATE: 

(E) COUNTRY: United Kingdom 

(F) ZIP: NG1 5DD 

(v) COMPUTER READABLE FORM : 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Miles, John Stephen 
<B) REGISTRATION NUMBER: 

(C) REFERENCE/DOCKET NUMBER: RPMW/ PI 9 192PC 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: +44 115 955 2211 

(B) TELEFAX: 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1260 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (p2c73) " 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . . 1257 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG AAA TTT ACA GAG TTA ACT GTT ACC GAA TTT GAC AAC TTT GTA CAA 48 
Met Lys Phe Thr Glu Leu Thr Val Thr Glu Phe Asp Asn Phe Val Gin 
15 10 15 

AAT CCA TCA TTG GAA AGT CAT TAT TTC CAA GTA AAA GAA AAT ATA GTT 96 
Asn Pro Ser Leu Glu Ser His Tyr Phe Gin Val Lys Glu Asn He Val 
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ACC CGT GAG AAT GAT GGC TTT GAA GTA GTT TTA TTA GGT ATT AAA GAC 144 
Thr Arg Glu Asn Asp Gly Phe Glu Val Val Leu Leu Gly lie Lys Asp 
35 40 45 

GAC AAT AAC AAA GTA ATT GCA GCA AGC CTT TTC TCT AAA ATT CCT ACT 192 
Asp Asn Asn Lys Val He Ala Ala Ser Leu Phe Ser Lys He Pro Thr 
50 55 60 

ATG GGA AGT TAT GTT TAC TAT TCG AAT CGT GGT CCA GTA ATG GAT TTT 24 0 

Met Gly Ser Tyr Val Tyr Tyr Ser Asn Arg Gly Pro Val Met Asp Phe 
65 70 75 80 

TCA GAT TTA GGA TTA GTT GAT TAT TAT TTA AAA GAG TTA GAT AAA TAT 28 8 

Ser Asp Leu Gly Leu Val Asp Tyr Tyr Leu Lys Glu Leu Asp Lys Tyr 
85 90 95 

TTA CAG CAA CAT CAA TGT TTA TAT GTT AAA TTA GAT CCG TAT TGG TTA 336 
Leu Gin Gin His Gin Cys Leu Tyr Val Lys Leu Asp Pro Tyr Trp Leu 
100 105 110 

TAT CAT CTA TAT GAT AAA GAT ATC GTG CCA TTT GAA GGT CGC GAG AAA 3 84 

Tyr His Leu Tyr Asp Lys Asp He Val Pro Phe Glu Gly Arg Glu Lys 
115 120 125 

AAT GAT GCC CTA GTA AAC TTG TTT AAA TCA CAT GGT TAC GAG CAT CAT 432 
Asn Asp Ala Leu Val Asn Leu Phe Lys Ser His Gly Tyr Glu His His 
130 135 140 

GGC TTT ACA ACT GAG TAT GAT ACA TCG AGC CAA GTA CGA TGG ATG GGC 480 
Gly Phe Thr Thr Glu Tyr Asp Thr Ser Ser Gin Val Arg Trp Met Gly 
145 150 155 160 

GTA TTA AAC CTT GAA GGT AAA ACA CCC GAA ACA TTG AAA AAG ACA TTT 528 
Val Leu Asn Leu Glu Gly Lys Thr Pro Glu Thr Leu Lys Lys Thr Phe 
165 170 175 

GAT AGT CAA CGT AAA CGT AAT ATT AAT AAA GCG ATA AAC TAT GGT GTT 576 
Asp Ser Gin Arg Lys Arg Asn He Asn Lys Ala He Asn Tyr Gly Val 
180 ' 185 190 

AAA GTC AGA TTC CTT GAA CGT GAT GAG TTC AAT CTT TTC TTA GAT TTA 624 
Lys Val Arg Phe Leu Glu Arg Asp Glu Phe Asn Leu Phe Leu Asp Leu 
195 200 205 

TAT CGT GAA ACT GAA GAG CGT GCT GGA TTT GTA TCA AAA ACA GAT GAT 672 
Tyr Arg Glu Thr Glu Glu Arg Ala Gly Phe Val Ser Lys Thr Asp Asp 
210 215 ' 220 

TAT TTT TAT AAC TTT ATT GAC ACA TAT GGA GAT AAA GTA TTA GTA CCA 72 0 

Tyr Phe Tyr Asn Phe He Asp Thr Tyr Gly Asp Lys Val Leu Val Pro 
225 230 235 240 

TTA GCA TAT ATT GAC CTT GAT GAA TAT GTG TTA AAG TTG CAA CAG GAA 768 
Leu Ala Tyr He Asp Leu Asp Glu Tyr Val Leu Lys Leu Gin Gin Glu 
24 5 2 50 255 

TTG AAT GAC AAA GAA AAT CGT CGT GAT CAA ATG ATG GCG AAA GAA AAC 816 
Leu Asn Asp Lys Glu Asn Arg Arg Asp Gin Met Met Ala Lys Glu Asn 
260 265 270 

AAA TCA GAT AAG CAA ATG AAG AAA ATT GCA GAA TTA GAT AAG CAA ATT 864 
Lys Ser Asp Lys Gin Met Lys Lys He Ala Glu Leu Asp Lys Gin He 
275 280 285 
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GAT CAT GAT CAG CAT GAA TTA TTG AAT GCA AGT GAA TTG AGC AAA ACG 912 
Asp His Asp Gin His Glu Leu Leu Asn Ala Ser Glu Leu Ser Lys Thr 
290 295 300 

GAC GGC CCA ATT CTA AAC CTT GCT TCT GGC GTT TAT TTT GCA AAT GCA 960 
Asp Gly Pro lie Leu Asn Leu Ala Ser Gly Val Tyr Phe Ala Asn Ala 
305 310 315 320 

TAT GAA GTG AAT TAT TTC TCT GGT GGT TCA TCA GAA AAA TAT AAT CAA 1008 
Tyr Glu Val Asn Tyr Phe Ser Gly Gly Ser Ser Glu Lys Tyr Asn Gin 
325 330 335 

TTT ATG GGA CCA TAC ATG ATG CAT TGG TTT ATG ATT AAC TAT TGC TTC 1056 
Phe Met Gly Pro Tyr Met Met His Trp Phe Met He Asn Tyr Cys Phe 
340 345 350 

GAT AAT GGC TAT GAT CGT TAT AAT TTC TAT GGT TTA TCA GGT GAT TTT 1104 
Asp Asn Gly Tyr Asp Arg Tyr Asn Phe Tyr Gly Leu Ser Gly Asp Phe 
355 * " 360 365 

ACG GAA AAC AGT GAA GAT TAT GGC GTA TAC CGC TTT AAA CGT GGA TTT 1152 
Thr Glu Asn Ser Glu Asp Tyr Gly Val Tyr Arg Phe Lys Arg Gly Phe 
370 375 380 

AAT GTA CAA ATC GAA GAA TTA ATA GGG GAT TTC TAT AAA CCA ATT CAT 12 0 0 

Asn Val Gin He Glu Glu Leu He Gly Asp Phe Tyr Lys Pro He His 
385 390 395 400 

AAA GTG AAA TAT TGG TTG TTC ACA AC A TTG GAT AAA TTA CGT AAA AAA 124 8 

Lys Val Lys Tyr Trp Leu Phe Thr Thr Leu Asp Lys Leu Arg Lys Lys 
405 410 415 

TTA AAG AAA TAG . 1260 
Leu Lys Lys 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 419 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Lys Phe Thr Glu Leu Thr Val Thr Glu Phe Asp Asn Phe Val Gin 
15 10 15 

Asn Pro Ser Leu Glu Ser His Tyr Phe Gin Val Lys Glu Asn He Val 
20 25 30 

Thr Arg Glu Asn Asp Gly Phe Glu Val Val Leu Leu Gly He Lys Asp 
35 40 45 

Asp Asn Asn Lys Val He Ala Ala Ser Leu Phe Ser Lys He Pro Thr 
50 55 60 

Met Gly Ser Tyr Val Tyr Tyr Ser Asn Arg Gly Pro Val Met Asp Phe 
65 70 75 80 

Ser Asp Leu Gly Leu Val Asp Tyr Tyr Leu Lys Glu Leu Asp Lys Tyr 
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85 90 95 

Leu Gin Gin His Gin Cys Leu Tyr Val Lys Leu Asp Pro Tyr Trp Leu 
100 105 110 

Tyr His Leu Tyr Asp Lys Asp He Val Pro Phe Glu Gly Arg Glu Lys 
115 120 125 



Asn Asp Ala Leu Val Asn Leu Phe Lys Ser His Gly Tyr Glu His His 
130 135 140 

Gly Phe Thr Thr Glu Tyr Asp Thr Ser Ser Gin Val Arg Trp Met Gly 
145 150 155 160 

Val Leu Asn Leu Glu Gly Lys Thr Pro Glu Thr Leu Lys Lys Thr Phe 
165 170 175 

Asp Ser Gin Arg Lys Arg Asn He Asn Lys Ala He Asn Tyr Gly Val 
180 185 190 

Lys Val Arg Phe Leu Glu Arg Asp Glu Phe Asn Leu Phe Leu Asp Leu 
195 200 205 

Tyr Arg Glu Thr Glu Glu Arg Ala Gly Phe Val Ser Lys Thr Asp Asp 
210 215 220 

Tyr Phe Tyr Asn Phe He Asp Thr Tyr Gly Asp Lys Val Leu Val Pro 
225 230 235 240 

Leu Ala Tyr He Asp Leu Asp Glu Tyr Val Leu Lys Leu Gin Gin Glu 
245 250 255 

Leu Asn Asp Lys Glu Asn Arg Arg Asp Gin Met Met Ala Lys Glu Asn 
260 265 270 

Lys Ser Asp Lys Gin Met Lys Lys He Ala Glu Leu Asp Lys Gin He 
275 280 285 

Asp His Asp Gin His Glu Leu Leu Asn Ala Ser Glu Leu Ser Lys Thr 
290 295 300 

Asp Gly Pro He Leu Asn Leu Ala Ser Gly Val Tyr Phe Ala Asn Ala 
305 310 315 320 

Tyr Glu Val Asn Tyr Phe Ser Gly Gly Ser Ser Glu Lys Tyr Asn Gin 
325 330 335 

Phe Met Gly Pro Tyr Met Met His Trp Phe Met He Asn Tyr Cys Phe 
340 345 350 

Asp Asn Gly Tyr Asp Arg Tyr Asn Phe Tyr Gly Leu Ser Gly Asp Phe 
355 360 365 

Thr Glu Asn Ser Glu Asp Tyr Gly Val Tyr Arg Phe Lys Arg Gly Phe 
370 375 380 

Asn Val Gin He Glu Glu Leu He Gly Asp Phe Tyr Lys Pro He His 
385 390 395 400 

Lys Val Lys Tyr Trp Leu Phe Thr Thr Leu Asp Lys Leu Arg Lys Lys 
405 410 415 

Leu Lys Lys 
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(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (p2c90)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



GACGTGCTGA 


TGCAAATATA 


AGTNCATGAC 


CATCATGTTG 


TAATTGTNAC 


AATGTATCAA 


60 


TAATAGTCTG 


GTCAATTAAT 


CGGCCGTCAA 


AACAAAGCGT 


ACCATCAATA 


TCANAGACAA 


120 


ATCTCATCAN 


ATCACTCCAA 


ACAATATAAT 


ACCATGATTA 


TAGCATAAGT 


NAGTCATGTN 


180 


AC GAT ATT AG 


ATAATGATTA 


TNATGTAGGG 


TACCTTTTGC 


CTTACACACA 


TATNACTTCC 


240 


TATNATATAT 


ATTATGTCAA 


CNNGAATGTN 


AAATTCCATA 


AGGGGACTTA 


TATAACTGTN 


300 


TGTCTGTNTA 


GTGTTTATGT 


CAGTCAGCTA 


AATTNACATT 


CATGTTATGT 


CTCATTAAAC 


360 


CAATTACTCA 


CGTNTTGGTG 


CATATCNCAT 


CTTTCATATC 


GTCATACATC 


TATCCTCATT 


420 


CTCNTGNCTG 


A 










431 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 729 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (p9b74) n 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1 . . 726 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

ATG ACT AAA TTA TTT ATA CCT TAT ATT ATG GGC AAT AAA GAT TTG ATT 4 8 

Met Thr Lys Leu Phe lie Pro Tyr lie Met Gly Asn Lys Asp Leu lie 
15 10 15 

GAA AAT GCA ACA TTG TTG AGT GAA AAT GGT GCA GAT ATA ATT GAA ATT 96 

Glu Asn Ala Thr Leu Leu Ser Glu Asn Gly Ala Asp lie lie Glu lie 
20 25 30 

GGA GTA CCT TTC TCT GAT CCG GTT GCT GAT GGT CCA GTT ATC ATG GAA 144 
Gly Val Pro Phe Ser Asp Pro Val Ala Asp Gly Pro Val lie Met Glu 
35 40 45 



GCA GGT CAA CAA GCG 



ATT AAA CAA GGC ATC ACG ATA GAT TAT ATT TTC 



192 
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Ala Gly Gin Gin Ala lie Lys Gin Gly He Thr He Asp Tyr He Phe 
50 55 60 

AAT CAA TTA GAA AAA CAT GGT GAT CAA ATT AAG TGT AAC TAT GTA TTA 24 0 

Asn Gin Leu ,Glu Lys His Gly Asp Gin He Lys Cys Asn Tyr Val Leu 

65 - . 70 75 ' 80 

ATG ACG TAT TAT AAT ATT ATT TGT CAT TAT GGA GAA CAA GCG TTT TTT 28 8 

Met Thr Tyr Tyr Asn He He Cys His Tyr Gly Glu Gin Ala Phe Phe 

85 90 95 

GAA AAA TGT CGA GAT ACT GGT GTC TAC GGC TTA ATT ATT CCT GAT TTA 33 6 

Glu Lys Cys Arg Asp Thr Gly Val Tyr Gly Leu He He Pro Asp Leu 
100 105 HO 

CCA TAT GAA TTA TCG CAG CGT TTA AAA CAA CAA TTT AGT CAC TAT GGC 384 

Pro Tyr Glu Leu Ser Gin Arg Leu Lys Gin Gin Phe Ser His Tyr Gly 
115 120 125 

GTC AAA ATC ATA TCG TTA GTT GCG ATG ACT ACT GAT GAC AAA CGT ATA 432 

Val Lys He lie Ser Leu Val Ala Met Thr Thr Asp Asp Lys Arg He 
130 135 140 

AAA GAT ATC GTA TCC CAT GCG GAA GGC TTT ATT TAT ACT GTG ACG ATG 4 80 

Lys Asp He Val Ser His Ala Glu Gly Phe He Tyr Thr Val Thr Met 

145 150 155 160 

AAT GCG ACA ACA GGG CAA AAC GGT GCG TTT CAT CCA GAA TTA AAA CGA 52 8 

Asn Ala Thr Thr Gly Gin Asn Gly Ala Phe His Pro Glu Leu Lys Arg 

165 170 175 

AAA ATT GAG TCA ATT AAA GCG ATA GCC AAT GTG CCA GTT GTC GCA GGA 576 

Lys He Glu Ser He Lys Ala He Ala Asn Val Pro Val Val Ala Gly 
180 185 190 

TTT GGT ATA AGA ACA CCA CAA CAT GTT GCA GAT ATA AAA GAG GTT GCA 624 

Phe Gly He Arg Thr Pro Gin His Val Ala Asp He Lys Glu Val Ala 
195 200 205 

GAT GGC ATT GTC ATT GGT AGC GAA ATC GTT AAG CGA TTT AAA TCT AAC 672 

Asp Gly He Val He Gly Ser Glu He Val Lys Arg Phe Lys Ser Asn 
210 215 220 

ACG CGT GAG GAA ATC ATT AAA TAT TTA CAA TCT ATC CAA CAA ACA TTG 720 

Thr Arg Glu Glu He He Lys Tyr Leu Gin Ser He Gin Gin Thr Leu 

225 230 235 240 

AAT AAT TAA 72 9 

Asn Asn 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Thr Lys Leu Phe lie Pro Tyr lie Met Gly Asn Lys Asp Leu lie. 
15 10 15 



WO 99/01473 



-57- 



PCT/GB98/01974 



Glu Asn Ala Thr Leu Leu Ser Glu 
20 

Gly Val Pro Phe Ser Asp Pro Val 
35 40 

Ala Gly Gin Gin Ala lie Lys Gin 

50 55 
Asn Gin Leu Glu Lys His Gly Asp 
65 70 

Met Thr Tyr Tyr Asn lie lie Cys 
85 

Glu Lys Cys Arg Asp Thr Gly Val 
100 

Pro Tyr Glu Leu Ser Gin Arg Leu 
115 120 



Asn Gly Ala Asp lie lie Glu lie 
25 30 

Ala Asp Gly Pro Val lie Met Glu 
45 

Gly lie Thr He Asp Tyr He Phe 
60 

Gin He Lys Cys Asn Tyr Val Leu 

75 80 

His Tyr Gly Glu Gin Ala Phe Phe 
90 95 

Tyr Gly Leu He He Pro Asp Leu 
105 110 

Lys Gin Gin Phe Ser His Tyr Gly 
125 



Val Lys He He Ser 
130 

Lys Asp He Val Ser 
145 

Asn Ala Thr Thr Gly 
165 

Lys He Glu Ser He 
180 

Phe Gly He Arg Thr 
195 

Asp Gly He Val He 
210 

Thr Arg Glu Glu lie 
225 

Asn Asn 



Leu Val Ala Met Thr Thr 
135 

His Ala Glu Gly Phe He 
150 155 

Gin Asn Gly Ala Phe His 
170 

Lys Ala He Ala Asn Val 
185 

Pro Gin His Val Ala Asp 
200 

Gly Ser Glu He Val Lys 
215 

He Lys Tyr Leu Gin Ser 
230 235 



Asp Asp Lys Arg He 
140 

Tyr Thr Val Thr Met 
160 

Pro Glu Leu Lys Arg 
175 

Pro Val Val Ala Gly 
190 

He Lys Glu Val Ala 
205 

Arg Phe Lys Ser Asn 
220 

He Gin Gin Thr Leu 
240 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1260 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (pllc29/pl3c83) " 



(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1..1257 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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ATG AAA TTT ACA GAG TTA 
Met Lys Phe Thr Glu Leu 
1 5 

AAT CCA TCA TTG GAA AGT 
Asn Pro Ser Leu Glu Ser 
20 

ACC CGT GAG AAT GAT GGC 
Thr Arg Glu Asn Asp Gly 
3 5 

GAC AAT AAC AAA GTA ATT 
Asp Asn Asn Lys Val lie 
50 

ATG GGA AGT TAT GTT TAC 
Met Gly Ser Tyr Val Tyr 
65 70 

TCA GAT TTA GGA TTA GTT 
Ser Asp Leu Gly Leu Val 
85 
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ACT GTT ACC GAA TTT GAC 
Thr Val Thr Glu Phe Asp 
10 

CAT TAT TTC CAA GTA AAA 
His Tyr Phe Gin Val Lys 
25 

TTT GAA GTA GTT TTA TTA 
Phe Glu Val Val Leu Leu 
40 

GCA GCA AGC CTT TTC TCT 
Ala Ala Ser Leu Phe Ser 
55 60 

TAT TCG AAT CGT GGT CCA 
Tyr Ser Asn Arg Gly Pro 

GAT TAT TAT TTA AAA GAG 
Asp Tyr Tyr Leu Lys Glu 
90 



AAC 


TTT 


GTA 


CAA 


48 


Asn 


Phe 


Val 


Gin 








15 






GAA 


AAT 


ATA 


GTT 


96 


Glu 


Asn 


He 


Val 






30 








GGT 


ATT 


AAA 


GAC 


144 


Gly 


He 


Lys 


Asp 




45 










AAA 


ATT 


CCT 


ACT 


192 


Lys 


He 


Pro 


Thr 




GTA 


ATG 


GAT 


TTT 


240 


Val 


Met 


Asp 


Phe 










80 




TTA 


GAT 


AAA 


TAT 


288 


Leu 


Asp 


Lys 


Tyr 





TTA CAG CAA CAT CAA TGT TTA TAT GTT AAA TTA GAT CCG TAT TGG TTA 3 36 

Leu Gin Gin His Gin Cys Leu Tyr Val Lys Leu Asp Pro Tyr Trp Leu 
100 * 105 ^ 110 

TAT CAT CTA TAT GAT AAA GAT ATC GTG CCA TTT GAA GGT CGC GAG AAA 3 84 

Tyr His Leu Tyr Asp Lys Asp He Val Pro Phe Glu Gly Arg Glu Lys 
115 120 125 

AAT GAT GCC CTA GTA AAC TTG TTT AAA TCA CAT GGT TAC GAG CAT CAT 4 32 

Asn Asp Ala Leu Val Asn Leu Phe Lys Ser His Gly Tyr Glu His His 
130 135 " 140 

GGC TTT ACA ACT GAG TAT GAT ACA TCG AGC CAA GTA CGA TGG ATG GGC 4 80 

Gly Phe Thr Thr Glu Tyr Asp Thr Ser Ser Gin Val Arg Trp Met Gly 
145 150 155 160 

GTA TTA AAC CTT GAA GGT AAA ACA CCC GAA ACA TTG AAA AAG ACA TTT 528 
Val Leu Asn Leu Glu Gly Lys Thr Pro Glu Thr Leu Lys Lys Thr Phe 
165 170 175 

GAT AGT CAA CGT AAA CGT AAT ATT AAT AAA GCG ATA AAC TAT GGT GTT . 576 
Asp Ser Gin Arg Lys Arg Asn He Asn Lys Ala He Asn Tyr Gly Val 
180 185 190 

AAA GTC AG A TTC CTT GAA CGT GAT GAG TTC AAT CTT TTC TTA GAT TTA 624 
Lys Val Arg Phe Leu Glu Arg Asp Glu Phe Asn Leu Phe Leu Asp Leu 
195 200 205 

TAT CGT GAA ACT GAA GAG CGT GCT GGA TTT GTA TCA AAA ACA GAT GAT 672 
Tyr Arg Glu Thr Glu Glu Arg Ala Gly Phe Val Ser Lys Thr Asp Asp 
210 215 220 

TAT TTT TAT AAC TTT ATT GAC ACA TAT GGA GAT AAA GTA TTA GTA CCA 720 
Tyr Phe Tyr Asn Phe He Asp Thr Tyr Gly Asp Lys Val Leu Val Pro 
225 230 235 240 

TTA GCA TAT ATT GAC CTT GAT GAA TAT GTG TTA AAG TTG CAA CAG GAA 76 8 

Leu Ala Tyr He Asp Leu Asp Glu Tyr Val Leu Lys Leu Gin Gin Glu 
245 250 255 
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TTG AAT GAC AAA GAA AAT CGT CGT GAT CAA ATG ATG GCG AAA GAA AAC 816 

Leu Asn Asp Lys Glu Asn Arg Arg Asp Gin Met Met Ala Lys Glu Asn 
260 265 270 

AAA TCA GAT AAG CAA ATG AAG AAA ATT GCA GAA TTA. GAT AAG CAA ATT 864 

Lys Ser Asp Lys Gin Met Lys Lys He Ala Glu Leu Asp Lys Gin He 
275 280 285 

GAT CAT GAT GAG CAT GAA TTA TTG AAT GCA AGT GAA TTG AGC AAA ACG 912 

Asp His Asp Gin His Glu Leu Leu Asn Ala Ser Glu Leu Ser Lys Thr 
290 295 300 

GAC GGC CCA ATT CTA AAC CTT GCT TCT GGC GTT TAT TTT GCA AAT GCA .960 

Asp Gly Pro He Leu Asn Leu Ala Ser Gly Val Tyr Phe Ala Asn Ala 

305 310 315 320 

TAT GAA GTG AAT TAT TTC TCT GGT GGT TCA TCA GAA AAA TAT AAT CAA 1008 

Tyr Glu Val Asn Tyr Phe Ser Gly Gly Ser Ser Glu Lys Tyr Asn Gin 
325 330 * 335 

TTT ATG GGA CCA TAC ATG ATG CAT TGG TTT ATG ATT AAC TAT TGC TTC 1056 

Phe Met Gly Pro Tyr Met Met His Trp Phe Met He Asn Tyr Cys Phe 
340 345 350 

GAT AAT GGC TAT GAT CGT TAT AAT TTC TAT GGT TTA TCA GGT GAT TTT 1104 

Asp Asn Gly Tyr Asp Arg Tyr Asn Phe Tyr Gly Leu Ser Gly Asp Phe 
355 360 ' 365 



ACG GAA AAC AGT GAA GAT TAT GGC GTA 
Thr Glu Asn Ser Glu Asp Tyr Gly Val 
370 375 

AAT GTA CAA ATC GAA GAA TTA ATA GGG 
Asn Val Gin He Glu Glu Leu He Gly 
385 390 

AAA GTG AAA TAT TGG TTG TTC ACA ACA 
Lys Val Lys Tyr Trp Leu Phe Thr Thr 
4 05 

TTA AAG AAA TAG 
Leu Lys Lys 



TAC CGC TTT AAA CGT GGA TTT 1152 
Tyr Arg Phe Lys Arg Gly Phe 
380 

GAT TTC TAT AAA CCA ATT CAT 1200 
Asp Phe Tyr Lys Pro He His 
395 400 

TTG GAT AAA TTA CGT AAA AAA 124 8 

Leu Asp Lys Leu Arg Lys Lys 
410 415 

1260 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Lys Phe Thr Glu Leu Thr Val Thr Glu Phe Asp Asn Phe Val Gin 
15 10 15 

Asn Pro Ser Leu Glu Ser His Tyr Phe Gin Val Lys Glu Asn He Val 
20 25 30 

Thr Arg Glu Asn Asp Gly Phe Glu Val Val Leu Leu Gly He Lys Asp 
35 40 45 
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Asp Asn Asn Lys 
50 

Met Gly Ser Tyr 
65 - 

Ser Asp Leu Gly 



Leu Gin Gin His 
100 

Tyr His Leu Tyr 
115 

Asn Asp Ala Leu 
130 

Gly Phe Thr Thr 
145 

Val Leu Asn Leu 



Asp Ser Gin Arg 
180 

Lys Val Arg Phe 
195 

Tyr Arg Glu Thr 
210 

Tyr Phe Tyr Asn 
225 

Leu Ala Tyr lie 



Leu Asn Asp Lys 
260 

Lys Ser Asp Lys 
2 75 

Asp His Asp Gin 
290 

Asp Gly Pro lie 
305 

Tyr Glu Val Asn 



Phe Met Gly Pro 
340 

Asp Asn Gly Tyr 
355 

Thr Glu Asn Ser 
370 

Asn Val Gin He 
385 



Val He 

Val Tyr 
70 

Leu Val 
85 

Gin Cys 

Asp Lys 
Val Asn 

Glu Tyr 
150 

Glu Gly 
165 

Lys Arg 
Leu Glu 



Glu Glu 

Phe He 
230 

Asp Leu 
245 

Glu Asn 

Gin Met 

His Glu 

Leu Asn 
310 

Tyr Phe 
325 

Tyr Met 

Asp Arg 

Glu Asp 

Glu Glu 
390 



Ala Ala 
55 

Tyr Ser 

Asp Tyr 

Leu Tyr 

Asp He 
120 
Leu Phe 
135 

Asp Thr 



Lys Thr 



Asn He 

Arg Asp 
200 

Arg Ala 
215 

Asp Thr 

Asp Glu 

Arg Arg 

Lys Lys 
280 

Leu Leu 
295 

Leu Ala 

Ser Gly 

Met His 

Tyr Asn 
360 

Tyr Gly 
375 

Leu He 



-60- 



Ser Leu 



Asn Arg 



Tyr Leu 
90 

Val Lys 
105 

Val Pro 
Lys Ser 

Ser Ser 

Pro Glu 
170 

Asn Lys 
185 

Glu Phe 

Gly Phe 

Tyr Gly 

Tyr Val 
250 

Asp Gin 
265 

He Ala 

Asn *Ala 

Ser Gly 

Gly Ser 
330 

Trp Phe 
345 

Phe Tyr 
Val Tyr 
Gly Asp 



Phe Ser 
60 

Gly Pro 
75 

Lys Glu 
Leu Asp 

Phe Glu 

His Gly 
140 

Gin Val 
155 

Thr Leu 

Ala He 

Asn Leu 

Val Ser 
220 

Asp Lys 
235 

Leu Lys 



Met Met 



Glu Leu 



Ser Glu 
300 

Val Tyr 
315 

Ser Glu 

Met He 

Gly Leu 

Arg Phe 
380 

Phe Tyr 
395 



Lys He 

Val Met 

Leu Asp 

Pro Tyr 
110 

Gly Arg 
125 

Tyr Glu 

Arg Trp 

Lys Lys 

Asn Tyr 
190 

Phe Leu 
205 

Lys Thr 



Val Leu 



Leu Gin 

Ala Lys 
270 

Asp Lys 
285 

Leu Ser 

Phe Ala 

Lys Tyr 

Asn Tyr 
350 

Ser Gly 
365 

Lys Arg 



Lys Pro 



Pro Thr 



Asp Phe 
80 

Lys Tyr 
95 

Trp Leu 

Glu Lys 
His His 

Met Gly 
160 

Thr Phe 
175 

Gly Val 



Asp Leu 



Asp Asp 



Val Pro 
240 

Gin Glu 
255 

Glu Asn 



Gin He 



Lys Thr 



Asn Ala 
320 

Asn Gin 
335 

Cys Phe 

Asp Phe 

Gly Phe 

He His 
400 



WO 99/01473 



PCT/GB98/01974 



-61- 



hys Val Lys Tyr Trp Leu Phe Thr Thr Leu Asp Lys Leu Arg Lys Lys 
40S 410 415 

Leu Lys Lys 

(2) INFORMATION FOR 1 SEQ ID NO : 8 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 731 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " DNA (genomic) 
(p4cl5/p6c63) ; Unknown = Xaa" 
(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1..417 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 524.. 730 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

ATG GAA TTC ACT TAT TCG TAT TTA TTT AGA ATG ATT AGT CAT GAG ATG 4 8 

Met Glu Phe Thr Tyr Ser Tyr Leu Phe Arg Met He Ser His Glu Met 
1 5 io 15 

AAA CAA AAG GCT GAT CAA AAG TTA GAG CAA TTT GAT ATT ACA AAT GAG 96 
Lys Gin Lys Ala Asp Gin Lys Leu Glu Gin Phe Asp He Thr Asn Glu 
20 25 30 

CAA GGT CAT ACG TTA GGT TAT CTT TAT GCA CAT CAA CAA GAT GGA CTG 144 
Gin Gly His Thr Leu Gly Tyr Leu Tyr Ala His Gin Gin Asp Gly Leu 
35 40 45 

ACA CAA AAT GAT ATT GCT AAA GCA TTA CAA CGA ACA GGT CCA ACT GTC 192 
Thr Gin Asn Asp He Ala Lys Ala Leu Gin Arg Thr Gly Pro Thr Val 
50 55 60 

AGT AAT TTA TTA AGG AAC CTT GAA CGT AAA AAG CTG ATC TAT CGC TAT 24 0 

Ser Asn Leu Leu Arg Asn Leu Glu Arg Lys Lys Leu He Tyr Arg Tyr 
65 70 " 75 * so 

GTC GAT GCA CAA GAT ACG AGA AGA AAG AAT ATA GGG CTG ACT ACC TCT 28 8 

Val Asp Ala Gin Asp Thr Arg Arg Lys Asn He Gly Leu Thr Thr Ser 
85 90 * 95 

GGG ATT AAA CTC GTA GAA GCA TTC ACT TCG ATA TTT GAT GAA ATG GAA 336 
Gly He Lys Leu Val Glu Ala Phe Thr Ser He Phe Asp Glu Met Glu 
100 105 no 

CAA ACA CTC GTA TCG CAG TTA TCT GAA GAA GAA AAT GAA CAA ATG AAA 3 84 

Gin Thr Leu Val Ser Gin Leu Ser Glu Glu Glu Asn Glu Gin Met Lys 
115 120 125 

GCA AAC TTA ACT AAA ATG TTA TCT AGT TTA CAA TAAATGATAA GTGTGACTGG 4 37 

Ala Asn Leu Thr Lys Met Leu Ser Ser Leu Gin 
130 135 

TAGAAATCAG TCACTTTGTC TTTAATATTA TAGTTAGATA TCTAATTGTT AGTAAGCTAA 4 97 
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TTATTGGAAA AGACAAGGAG TATTGA ACA ATG AAA GAC GAA CAA TTA TAT TAT 55 0 

Thr Met Lys Asp Glu Gin Leu Tyr Tyr 
1 5 

TTT GAG AAA TCG CCA GTA TTT AAA GCG ATG ATG CAT TTC TCA TTG CCA 5 98 

Phe Glu Lys Ser Pro Val Phe Lys Ala Met Met His Phe Ser Leu Pro 
10 15 20 25 

ATG ATG ATA GGG ACT TTA TTA AGC GTT ATT TAT GGC ATA TTA AAT ATT 646 
Met Met He Gly Thr Leu Leu Ser Val He Tyr Gly He Leu Asn He 
30 35 40 

TAC TTT ATA GGA TTT TYA GAM GAY AGC CAC ATG ATT TCT GCT AAT CTC 694 
Tyr Phe He Gly Phe Xaa Xaa Asp Ser His Met He Ser Ala Asn Leu 
45 50 55 

TCT AAC ACT GCC AGT ATT TGC TAT CTT AAT GGG GTT A 731 
Ser Asn Thr Ala Ser He Cys Tyr Leu Asn Gly Val 
60 65 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Met Glu Phe Thr Tyr Ser Tyr Leu Phe Arg Met He Ser His Glu Met 
15 10 15 

Lys Gin Lys Ala Asp Gin Lys Leu Glu Gin Phe Asp He Thr Asn Glu 
20 25 30 

Gin Gly His Thr Leu Gly Tyr Leu Tyr Ala His Gin Gin Asp Gly Leu 
35 40 45 

Thr Gin Asn Asp He Ala Lys Ala Leu Gin Arg Thr Gly Pro Thr Val 
50 55 60 

Ser Asn Leu Leu Arg Asn Leu Glu Arg Lys Lys Leu He Tyr Arg Tyr 
65 70 75 • 80 

Val Asp Ala Gin Asp Thr Arg Arg Lys Asn He Gly Leu Thr Thr Ser 
85 90 95 

Gly lie Lys Leu Val Glu Ala Phe Thr Ser He Phe Asp Glu Met Glu 
100 105 110 

Gin Thr Leu Val Ser Gin Leu Ser Glu Glu Glu Asn Glu Gin Met Lys 
115 120 125 

Ala Asn Leu Thr Lys Met Leu Ser Ser Leu Gin 
130 135 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
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( A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Thr Met Lys Asp Glu Gin Leu Tyr Tyr Phe Glu Lys Ser Pro Val Phe 
15 10 15 

Lys Ala Met Met His Phe Ser Leu Pro Met Met He Gly Thr Leu Leu 
20 25 30 

X 

Ser Val He Tyr Gly He Leu Asn He Tyr Phe He Gly Phe Xaa Xaa 
35 40 45 

Asp Ser His Met He Ser Ala Asn Leu Ser Asn Thr Ala Ser lie Cys 
50 55 60 

Tyr Leu Asn Gly Val 
65 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 828 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = n DNA (genomic) (P5C4)/* 



(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1..82 5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GTG AGG TAC TTG CCT GTG AAT GCG ATT GAA ATC CCG ACT ACC GCC GGC 4 8 

Val Arg Tyr Leu Pro Val Asn Ala He Glu He Pro Thr Thr Ala Gly 
15 10 15 

ACG CCT GAC GCG CCC TTC TAC CAA CCG TTG GGC AAT GAA GAG CAG CTG 96 
Thr Pro Asp Ala Pro Phe Tyr Gin Pro Leu Gly Asn Glu Glu Gin Leu 
20 25 30 

TTC CAG CAG GCC TGG CAG CAC GGC ATG CCC GTG CTT ATC AAG GGC CCG 14 4 

Phe Gin Gin Ala Trp Gin His Gly Met Pro Val Leu lie Lys Gly Pro 
35 40 45 

ACC GGC TGC GGC AAG ACC CGT TTC GTA CAG CAC ATG GCG CAT CGC CTG 192 
Thr Gly Cys Gly Lys Thr Arg Phe Val Gin His Met Ala His Arg Leu 
50 55 60 

AAT CTG CCG CTG TAC ACC GTG GCC TGC CAT GAC GAC CTG TCG GCG GCC 240 
Asn Leu Pro Leu Tyr Thr Val Ala Cys His Asp Asp Leu Ser Ala Ala 
65 70 75 80 

GAC CTG GTC GGC CGA CAC CTG ATC GGC GCA CAG GGC ACC TGG TGG CAG 28 8 

Asp Leu Val Gly Arg His Leu lie Gly Ala Gin Gly Thr Trp Trp Gin 
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85 90 95 

GAC GGT CCG CTG ACC CGC GCG GTC CGC GAA GGA GGC ATC TGC TAC CTG 336 
Asp Gly Pro Leu Thr Arg Ala Val Arg Glu Gly Gly lie Cys Tyr Leu 
100 105 110 

GAC GAA GTG GTG GAA GCA CGG CAG GAC ACC GCC GTG GTA CTG CAC CCG 384 
Asp Glu Val Val Glu Ala Arg Gin Asp Thr Ala Val Val Leu His Pro 
115 120 125 

CTG GCC GAT GAT CGC CGC GAA CTG TTC ATC GAG CGC ACC GGC GAG GCG 432 
Leu Ala Asp Asp Arg Arg Glu Leu Phe lie Glu Arg Thr Gly Glu Ala 
130 " 135 140 

CTC AAG GCG CCG CCG GGC TTC ATG CTG GTG GTG TCC TAC AAC CCC GGT 4 80 

Leu Lys Ala Pro Pro Gly Phe Met Leu Val Val Ser Tyr Asn Pro Gly 
145 ' 150 155 160 

TAC CAA AAC CTG CTC AAG GGC ATG AAG CCC AGC ACC CGC CAG CGC TTC 52 8 

Tyr Gin Asn Leu Leu Lys Gly Met Lys Pro Ser Thr Arg Gin Arg Phe 
165 170 175 

GTG GCG ATG CGC TTC GAC TAT CCG CCG ACC GCC GAG GAA GAG CGC ATC 576 
Val Ala Met Arg Phe Asp Tyr Pro Pro Thr Ala Glu Glu Glu Arg lie 
180 185 190 

GTC GCC AAC GAG GCG CAG GTC GAT GCC GCG CTC GCC GCC CAG GTG GTC 624 
Val Ala Asn Glu Ala Gin Val Asp Ala Ala Leu Ala Ala Gin Val Val 
195 200 205 

AAG CTT GGC CAG GCA CTG CGT CGG CTG GAA CAG CAC GAT CTG GAG GAA 672 
Lys Leu Gly Gin Ala Leu Arg Arg Leu Glu Gin His Asp Leu Glu Glu 
.210 " 215 220 

GTC GCC TCG ACC CGC CTG CTG ATC TTC ACC GCA CGC ATG ATC CGC TCC 720 
Val Ala Ser Thr Arg Leu Leu He Phe Thr Ala Arg Met He Arg Ser 
225 230 235 240 

GGC ATG ACG CCG CGG CAG GCC TGC CTG GCC TGC CTC GCC GAA CCG CTG 76 8 

Gly Met Thr Pro Arg Gin Ala Cys Leu Ala Cys Leu Ala Glu Pro Leu 
245 250 255 

TCG GAT GAT CCG CAG ACC GTT GCC GCG CTG ATG GAT GTG GTC TAT GTC 816 
Ser Asp Asp Pro Gin Thr Val Ala Ala Leu Met Asp Val Val Tyr Val 
260 265 270 

CAC TTC GGC TGA 828 
His Phe Gly 
275 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 275 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Val Arg Tyr Leu Pro Val Asn Ala He Glu He Pro Thr Thr Ala Gly 
15 10 15 
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Thr Pro Asp Ala Pro Phe Tyr Gin Pro Leu Gly Asn Glu Glu Gin Leu 
20 25 30 

Phe Gin Gin Ala Trp Gin His Gly Met Pro Val Leu lie Lys Gly Pro 
35 40 45 

Thr Gly Cys Gly Lys Thr Arg Phe Val Gin His Met Ala His Arg Leu 
50 55 60 

Asn Leu Pro Leu Tyr Thr Val Ala Cys His Asp Asp Leu Ser Ala Ala 
65 70 75 80 

Asp Leu Val Gly Arg His Leu lie Gly Ala Gin Gly Thr Trp Trp Gin 
85 90 95 

Asp Gly Pro Leu Thr Arg Ala Val Arg Glu Gly Gly lie Cys Tyr Leu 
100 105 110 

Asp Glu Val Val Glu Ala Arg Gin Asp Thr Ala Val Val Leu His Pro 
115 120 125 

Leu Ala Asp Asp Arg Arg Glu Leu Phe lie Glu Arg Thr Gly Glu Ala 
130 135 140 

Leu Lys Ala Pro Pro Gly Phe Met Leu Val Val Ser Tyr Asn Pro Gly 
145 150 155 160 

Tyr Gin Asn Leu Leu Lys Gly Met Lys Pro Ser Thr Arg Gin Arg Phe 
165 ' 170 175 

Val Ala Met Arg Phe Asp Tyr Pro Pro Thr Ala Glu Glu Glu Arg lie 
180 ~ 185 190 

Val Ala Asn Glu Ala Gin Val Asp Ala Ala Leu Ala Ala Gin Val Val 
195 200 205 

Lys Leu Gly Gin Ala Leu Arg Arg Leu Glu Gin His Asp Leu Glu Glu 
210 215 220 

Val Ala Ser Thr Arg Leu Leu lie Phe Thr Ala Arg Met lie Arg Ser 
225 230 235 240 

Gly Met Thr Pro Arg Gin Ala Cys Leu Ala Cys Leu Ala Glu Pro Leu 
245 250 255 

Ser Asp Asp Pro Gin Thr Val Ala Ala Leu Met Asp Val Val Tyr Val 
260 265 270 

His Phe Gly 
275 

(2) INFORMATION FOR SEQ ID N0:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (p9b66) " 
Unknown = Xaa 

( ix ) FEATURE : 

(A) NAME /KEY: CDS 
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(B) LOCATION: 1. .525 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ATG GAA AGG ATG TCA AAA ATG AAT ATT AAT ACA GCT TAT TTT GCC GGA 4 8 

Met Glu Arg Met Ser Lys Met Asn lie Asn Thr Ala Tyr Phe Ala Gly 
1 5 10 15 

GGT TGC TTT TGG TGT ATG ACG AAA CCA TTT GAC ACC TTT GAC GGC ATA 96 
Gly Cys Phe Trp Cys Met Thr Lys Pro Phe Asp Thr Phe Asp Gly He 
20 25 30 

GAA AAA GTA ACT TCT GGA TAT ATG GGC GGA CAT ATT GAA AAT CCT ACT 144 
Glu Lys Val Thr Ser Gly Tyr Met Gly Gly His He Glu Asn Pro Thr 
35 40 45 

TAC GAA CAA GTA AAA TCA GGT ACG AGT GGT CAT TTA GAA ACT GTT GAA 192 
Tyr Glu Gin Val Lys Ser Gly Thr Ser Gly His Leu Glu Thr Val Glu 
50 55 60 

ATT CAA TAT GAT GTT GCA TTA TTC TCA TAC AAT AAG TTA TTA GAA ATA 24 0 

He Gin Tyr Asp Val Ala Leu Phe Ser Tyr Asn Lys Leu Leu Glu He 
65 70' ' 75 80 

TTT TTC TCA GTC ATT GAC CCA TTA GAT ACA GGT GGT CAA TAT CAA GAC 288 
Phe Phe Ser Val He Asp Pro Leu Asp Thr Gly Gly Gin Tyr Gin Asp 
85 * 90 95 

CGT GGT CCT CAA TAT nAA ACA GCT ATT TTC TAC ACT AAT GAT CAT CAA 336 
Arg Gly Pro Gin Tyr Xaa Thr Ala He Phe Tyr Thr Asn Asp His Gin 
100 105 HO 

AAA . GAA CTC GCT GAG ACT TAT ATC GAG CAG CTT AAA AAT ACG ATT AAT 3 84 

Lys Glu Leu Ala Glu Thr Tyr He Glu Gin Leu Lys Asn Thr He Asn 
115 120 125 

GCT GAT AAG GCA ATT GCA ACA AAA ATA yTA CCA GCG TCA CAA TTT TAC 4 32 

Ala Asp Lys Ala He Ala Thr Lys He Leu Pro Ala Ser Gin Phe Tyr 
130 135 140 

AAA GCC GAA GAC TAT CAC CAA GAT TTT TAT AAG AAA AAT CCA GAG CGC 4 80 

Lys Ala Glu Asp Tyr His Gin Asp Phe Tyr Lys Lys Asn Pro Glu Arg 
145 150 155 160 

TAT GCA GAA GAA CAA AAA ATA CGC CAA GAA TAC AAA AAT AAG CAA 52 5 

Tyr Ala Glu Glu Gin Lys He Arg Gin Glu Tyr Lys Asn Lys Gin 
165 ~* 170 175 

TAA 52 8 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 5 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Met Glu Arg Met Ser Lvs Met Asn He Asn Thr Ala Tyr Phe Ala Gly 
1 5 10 15 
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Gly Cys Phe Trp Cys Met Thr Lys Pro Phe Asp Thr Phe Asp Gly lie 
20 25 30 

Glu Lys Val Thr Ser Gly Tyr Met Gly Gly His lie Glu Asn Pro Thr 
35 40 45 

Tyr Glu Gin Val Lys Ser Gly Thr Ser Gly His Leu Glu Thr Val Glu 
50 55 ~ 60 

lie Gin Tyr Asp Val Ala Leu Phe Ser Tyr Asn Lys Leu Leu Glu lie 
65 70 75 80 

Phe Phe Ser Val lie Asp Pro Leu Asp Thr Gly Gly Gin Tyr Gin Asp 
85 90 95 

Arg Gly Pro Gin Tyr Xaa Thr Ala lie Phe Tyr Thr Asn Asp His Gin 
100 105 110 

Lys Glu Leu Ala Glu Thr Tyr lie Glu Gin Leu Lys Asn Thr lie Asn 
115 120 125 

Ala Asp Lys Ala lie Ala Thr Lys lie Leu Pro Ala Ser Gin Phe Tyr 
130 135 140 

Lys Ala Glu Asp Tyr His Gin Asp Phe Tyr Lys Lys Asn Pro Glu Arg 
145 150 155 160 

Tyr Ala Glu Glu Gin Lys lie Arg Gin Glu Tyr Lys Asn Lys Gin 
165 * 110 175 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) <p7c26)" 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. .699 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATG ATT GAG TTA AAA CAT GTG ACT TTT GGT TAT AAT AAA AAG CAG ATG 4 8 

Met He Glu Leu Lys His Val Thr Phe Gly Tyr Asn Lys Lys Gin Met 
1 5 10 15 

GTG CTA CAA GAT ATC AAT ATT ACT ATA CCT GAT GGA GAA AAT GTT GGT 96 
Val Leu Gin Asp He Asn He Thr He Pro Asp Gly Glu Asn Val Gly 
20 25 30 

ATT TTA GGC GAA AGT GGC TGT GGT AAA AGT ACG CTC GCT TCA TTG GTT 144 
He Leu Gly Glu Ser Gly Cys Gly Lys Ser Thr Leu Ala Ser Leu Val 
35 40 45 

CTT GGC TTG TTT AAA CCT GTT AAA GGA GAG ATT TAC TTA AGT GAC AAT 192 
Leu Gly Leu Phe Lys Pro Val Lys Gly Glu He Tyr Leu Ser Asp Asn 
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50 55 60 

GCT GTG TTA CCG ATT TTC CAA CAC CCT TTA ACT AGC TTT AAC CCT GAT 24 0 

Ala Val Leu Pro lie Phe Gin His Pro Leu Thr Ser Phe Asn Pro Asp 
65 70 75 80 

TGG ACG ATT GAG ACC TCA TTA AAA GAA GCG TTA TAT TAT TAC AGA GGT 288 
Trp Thr lie Glu Thr Ser Leu Lys Glu Ala Leu Tyr Tyr Tyr Arg Gly 
85 90 95 

CTA ACT GAT AAT ACT GCT CAG GAT CAA TTA TTA TTA CAA CAT TTA TCT 336 
Leu Thr Asp Asn Thr Ala Gin Asp Gin Leu Leu Leu Gin His Leu Ser 
100 ■ 105 110 

ACT TTT GAG TTA AAC GCG CAA TTA TTG ACT AAA TTA CCA AGC GAA GTG 3 84 

Thr Phe Glu Leu Asn Ala Gin Leu Leu Thr Lys Leu Pro Ser Glu Val 
115 120 125 

AGT GGC GGA CAA TTA CAA AGA TTT AAT GTC ATG CGT TCG TTA TTA GCA 4 32 

Ser Gly Gly Gin Leu Gin Arg Phe Asn Val Met Arg Ser Leu Leu Ala 
130 135 140 

CAG CCT CGC GTT TTA ATA TGT GAT GAG ATA ACT TCA AAT TTA GAT GTT 4 80 

Gin Pro Arg Val Leu lie Cys Asp Glu lie Thr Ser Asn Leu Asp Val 
145 150 155 160 

ATA GCT GAA CAA AAT GTA ATC AAT ATA TTA AAA GCG CAA ACG ATT ACG 52 8 

He Ala Glu Gin Asn Val He Asn He Leu Lys Ala Gin Thr He Thr 
165 170 175 

AAC TTA AAT CAT TTT ATC GTT ATT TCT CAT GAT TTA TCC GTG TTA CAA 576 
Asn Leu Asn His Phe lie Val He Ser His Asp Leu Ser Val Leu Gin 
180 185 190 

CGC TTA GTT AAT AGA ATT ATC GTT CTT AAG GAT GGC ATG ATA GTC GAT 624 
Arg Leu Val Asn Arg He He Val Leu Lys Asp Gly Met He Val Asp 
195 2 00 205 

GAT TTT GCA ATA GAG GAA TTA TTT AAT GTT GAT AGA CAC CCT TAT AC A 672 
Asp Phe Ala He Glu Glu Leu Phe Asn Val Asp Arg His Pro Tyr Thr 
210 215 * 220 

AAA GAA TTA GTG CAA GCA TTT TCA TAT TAG 7 02 

Lys Glu Leu Val Gin Ala Phe Ser Tyr 
225 230 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met He Glu Leu Lys His Val Thr Phe Gly Tyr Asn Lys Lys Gin Met 
1 5 10 15 

Val Leu Gin Asp He Asn He Thr He Pro Asp Gly Glu Asn Val Gly 
20 25 30 

He Leu Gly Glu Ser Gly Cys Gly Lys Ser Thr Leu Ala Ser Leu Val 
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35 40 45 

Leu Gly Leu Phe Lys Pro Val Lys Gly Glu lie Tyr Leu Ser Asp Asn 
50 55 60 

Ala Val Leu Pro lie Phe Gin His Pro Leu Thr Ser Phe Asn Pro Asp 
65 70 75 8.0 

Trp Thr lie Glu Thr Ser Leu Lys Glu Ala Leu Tyr Tyr Tyr Arg Gly 
85 90 95 

Leu Thr Asp Asn Thr Ala Gin Asp Gin Leu Leu Leu Gin His Leu Ser 
100 105 110 

Thr Phe Glu Leu Asn Ala Gin Leu Leu Thr Lys Leu Pro Ser Glu Val 
115 120 125 

Ser Gly Gly Gin Leu Gin Arg Phe Asn Val Met Arg Ser Leu Leu Ala 
130 135 140 

Gin Pro Arg Val Leu lie Cys Asp Glu lie Thr Ser Asn Leu Asp Val 
145 150 155 160 

He Ala Glu Gin Asn Val He Asn He Leu Lys Ala Gin Thr He Thr 
165 170 175 

Asn Leu Asn His Phe He Val He Ser His Asp Leu Ser Val Leu Gin 
180 185 190 

Arg Leu Val Asn Arg He lie Val Leu Lys Asp Gly Met He Val Asp 
195 ~ 200 205 

Asp Phe Ala He Glu Glu Leu Phe Asn Val Asp Arg His Pro Tyr Thr 
210 215 220 

Lys Glu Leu Val Gin Ala Phe Ser Tyr 
225 230 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1471 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (plOclS) " 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..774 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ATG ATG AGT CTC ATT GAT ATA CAA AAT TTA ACA ATA AAG AAT ACT AGT 4 8 

Met Met Ser Leu He Asp He Gin Asn Leu Thr lie Lys Asn Thr Ser 
15 10 15 

GAG AAA TCT CTT ATT AAA GGG ATT GAT TTG AAA ATT TTT AGT CAA CAG 96 
Glu Lys Ser Leu lie Lys Gly He Asp Leu Lys He Phe Ser Gin Gin 
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20 25 30 

ATT AAT GCC TTG ATT GGA GAG AGC GGC GCT GGA AAA AGT TTG ATT GCT 144 
He Asn Ala Leu He Gly Glu Ser Gly Ala Gly Lys Ser Leu He Ala 
35 40 45 

AAA GCT TTA CTT GAA TAT TTA CCA TTT GAT TTA AGC TGC ACG TAT GAT 192 
Lys Ala Leu Leu Glu Tyr Leu Pro Phe Asp Leu Ser Cys Thr Tyr Asp 
50 55 60 

TCG TAC CAA TTT GAT GGG GAA AAT GTT AGT AGA TTG AGT CAA TAT TAT 240 
Ser Tyr Gin Phe Asp Gly Glu Asn Val Ser Arg Leu Ser Gin Tyr Tyr 
65 70 75 ' 80 

GGT CAT ACA ATT GGC TAT ATT TCT CAA AAT TAT GCA GAA AGT TTT AAC 288 
Gly His Thr He Gly Tyr He Ser Gin Asn Tyr Ala Glu Ser Phe Asn 
85 90 95 

GAC CAT ACT AAA TTA GGT AAA CAG TTA ACT GCG ATT TAT CGT AAG CAT 336 
Asp His Thr Lys Leu Gly Lys Gin Leu Thr Ala He Tyr Arg Lys His 
100 * 105 110 

TAT AAA GGT AGT AAA GAA GAG GCT TTG TCC AAA GTT GAT AAG GCT TTG 3 84 

Tyr Lys Gly Ser Lys Glu Glu Ala Leu Ser Lys Val Asp Lys Ala Leu 
115 120 125 

TCG TGG GTT AAT TTA CAA AGC AAA GAT ATA TTA AAT AAA TAT AGT TTC 432 
Ser Trp Val Asn Leu Gin Ser Lys Asp He Leu Asn Lys Tyr Ser Phe 
130 135 140 

CAA CTT TCT GGG GGC CAA CTT GAA CGC GTA TAC ATA GCA AGC GTT CTC 4 80 

Gin Leu Ser Gly Gly Gin Leu Glu Arg Val Tyr He Ala Ser Val Leu 
145 150 155 160 

ATG TTG GAG CCT AAA TTA ATC ATT GCA GAC GAA CCA GTT GCA TCA TTG 52 8 

Met Leu Glu Pro Lys Leu He He Ala Asp Glu Pro Val Ala Ser Leu 
165 170 175 

GAT GCT TTG AAC GGT AAT CAA GTG ATG GAT TTA TTA CAG CAT ATT GTA 576 
Asp Ala Leu Asn Gly Asn Gin Val Met Asp Leu Leu Gin His He Val 
180 185 190 

TTA GAA CAT GGT CAA ACA TTA TTT ATT ATC ACA CAT AAC TTA AGT CAT 624 
Leu Glu His Gly Gin Thr Leu Phe He He Thr His Asn Leu Ser His 
195 200 205 

GTA TTG AAA TAT TGT CAG TAC ATT TAT GTT TTA AAA GAA GGT CAA ATC 672 
Val Leu Lys Tyr Cys Gin Tyr He Tyr Val Leu Lys Glu Gly Gin He 
210 215 220 

ATT GAA CGA GGT AAT ATT AAT CAT TTC AAG TAT GAG CAT TTG CAT CCG 72 0 

He Glu Arg Gly Asn He Asn His Phe Lys Tyr Glu His Leu His Pro 
225 230 235 240 

TAT ACT GAA CGT CTA ATT AAA TAT AGA ACA CAA TTA AAG AGG GAT TAC 768 
Tyr Thr Glu Arg Leu He Lys Tyr Arg Thr Gin Leu Lys Arg Asp Tyr 
245 250 255 

TAT GAT TGAGTTAAAA CATGTGACTT TTGGTTATAA TAAAAAGCAG ATGGTGCTAC 824 
Tyr Asp 



AAGATATCAA TATTACTATA CCTGATGGAG AAAATGTTGG TATTTTAGGC GAAAGTGGCT 884 
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GTGGTAAAAG 


TACGCTCGCT 


TCATTGGTTC 


TTGGCTTGTT 


TAAACCTGTT 


AAAGGAGAGA 


944 


TTTACTTAAG 


TGACAATGCT 


GTGTTACCGA 


TTTTCCAACA 


CCCTTTAACT 


AGCTTTAACC 


1004 


CTGATTGGAC 


GATTGAGACC 


TCATTAAAAG 


AAGCGTTATA 


TTATTACAGA 


GGTCTAACTG 


1064 


ATAATACTGC 


TCAGGATCAA 


TTATTATTAC 


AACATTTATC 


TACTTTTGAG 


TTAAACGCGC 


1124 


AATTATTGAC 


TAAATTACCA 


AGCGAAGTGA 


GTGGCGGACA 


ATTACAAAGA 


TTTAATGTCA 


1184 


TGCGTTCGTT 


ATTAGCACAG 


CCTCGCGTTT 


TAATATGTGA 


TGAGATAACT 


TCAAATTTAG 


1244 


ATGTTATAGC 


TGAACAAAAT 


GTAATCAATA 


TATTAAAAGC 


GCAAACGATT 


ACGAACTTAA 


1304 


ATCATTTTAT 


CGTTATTTCT 


CATGATTTAT 


CCGTGTTACA 


ACGCTTAGTT 


AATAGAATTA 


1364 


TCGTTCTTAA 


GGATGGCATG 


ATAGTCGATG 


ATTTTGCAAT 


AGAGGAATTA 


TTTAATGTTG 


1424 


ATAGACACCC 


TTATACAAAA 


GAATTAGTGC 


AAGCATTTTC 


ATATTAG 




1471 



(2) INFORMATION FOR SEQ ID NO: 18: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 18: 

Met Met Ser Leu lie Asp lie Gin Asn Leu Thr lie Lys Asn Thr Ser 
1 5 10 15 

Glu Lys Ser Leu lie Lys Gly lie Asp Leu Lys lie Phe Ser Gin Gin 

20 25 30 

lie Asn Ala Leu lie Gly Glu Ser Gly Ala Gly Lys Ser Leu lie Ala 
35 40 45 

Lys Ala Leu Leu Glu Tyr Leu Pro Phe Asp Leu Ser Cys Thr Tyr Asp 
50 55 60 

Ser Tyr Gin Phe Asp Gly Glu Asn Val Ser Arg Leu Ser Gin Tyr Tyr 
65 70 75 80 

Gly His Thr lie Gly Tyr lie Ser Gin Asn Tyr Ala Glu Ser Phe Asn 
85 90 95 

Asp His Thr Lys Leu Gly Lys Gin Leu Thr Ala He Tyr Arg Lys His 
100 105 110 

Tyr Lys Gly Ser Lys Glu Glu Ala Leu Ser Lys Val Asp Lys Ala Leu 
115 120 125 

Ser Trp Val Asn Leu Gin Ser Lys Asp He Leu Asn Lys Tyr Ser Phe 
130 135 140 

Gin Leu Ser Gly Gly Gin Leu Glu Arg Val Tyr He Ala Ser Val Leu 
145 150 155 160 

Met Leu Glu Pro Lys Leu He He Ala Asp Glu Pro Val Ala Ser Leu 
165 170 175 

Asp Ala Leu Asn Gly Asn Gin Val Met Asp Leu Leu Gin His He Val 
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180 185 190 

Leu Glu His Gly Gin Thr Leu Phe He He Thr His Asn Leu Ser His 
195 200 205 

Val Leu Lys Tyr Cys Gin Tyr He Tyr Val Leu Lys Glu Gly Gin He 
210 215 220 

He Glu Arg Gly Asn. lie Asn His Phe Lys Tyr Glu His Leu His Pro 
225 230 235 240 

Tyr Thr Glu Arg Leu He Lys Tyr Arg Thr Gin Leu Lys Arg Asp Tyr 
245 250 .255 

Tyr Asp 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (plOclS) " 



(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 770.. 1468 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 



ATGATGAGTC 


TCATTGATAT 


ACAAAATTTA 


ACAATAAAGA 


ATACTAGTGA 


GAAATCTCTT 


60 


ATTAAAGGGA 
GGCGCTGGAA 


TTGATTTGAA 
AAAGTTTGAT 


AATTTTTAGT 
TGCTAAAGCT 


CAACAGATTA 
TTACTTGAAT 


ATGCCTTGAT 
ATTTACCATT 


TGGAGAGAGC 
TGATTTAAGC 


120 
180 


TGCACGTATG 


ATTCGTAC C A 


ATTTGATGGG 


GAAAATGTTA 


GTAGATTGAG 


TCAATATTAT 


240 


GGTCATACAA 


TTGGCTATAT 


TTCTCAAAAT 


TATG C AG AAA 


GTTTTAACGA 


CCATACTAAA 


300 


TTAGGTAAAC 


AGTTAACTGC 


GATTTATCGT 


AAG C ATT AT A 


AAGGTAGTAA 


AGAAGAGGCT 


360 


TTGTCCAAAG 


TTGATAAGGC 


TTTGTCGTGG 


GTTAATTTAC 


AAAGCAAAGA 


TATATTAAAT 


420 


AAATATAGTT 


TCCAACTTTC 


TGGGGGCCAA 


CTTGAACGCG 


TATACATAGC 


AAGCGTTCTC 


480 


ATGTTGGAGC 


CTAAATTAAT 


CATTGCAGAC 


GAACCAGTTG 


CATCATTGGA 


TGCTTTGAAC 


540 


GGTAATCAAG 


TGATGGATTT 


ATTACAGCAT 


ATTGTATTAG 


AACATGGTCA 


AACATTATTT 


600 


ATTATCACAC 


ATAACTTAAG 


TCATGTATTG 


AAATATTGTC 


AGTACATTTA 


TGTTTTAAAA 


660 


GAAGGTCAAA 


TCATTGAACG 


AGGTAATATT 


AATCATTTCA 


AGTATG AG C A 


TTTGCATCCG 


720 


TATACTGAAC 


GTCTAATTAA 


ATATAGAACA 


CAATTAAAGA 


GGGATTACT ATG ATT 


775 



Met He 
1 

GAG TTA AAA CAT GTG ACT TTT GGT TAT AAT AAA AAG CAG ATG GTG CTA 823 
Glu Leu Lys His Val Thr Phe Gly- Tyr Asn Lys Lys Gin Met Val Leu 
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5 10 15 

CAA GAT ATC AAT ATT ACT ATA CCT GAT GGA GAA AAT GTT GGT ATT TTA 871 

Gin Asp He Asn He Thr He Pro Asp Gly Glu Asn Val Gly He Leu 
2 0 25 30 

GGC GAA AGT GGC TGT GGT AAA AGT ACG CTC GCT TCA TTG GTT CTT GGC 919 

Gly Glu Ser Gly Cys Gly Lys Ser Thr Leu Ala Ser Leu Val Leu Gly 
3 5 40 45 50 

TTG TTT AAA CCT GTT AAA GGA GAG ATT TAC TTA AGT GAC AAT GCT GTG 967 

Leu Phe Lys Pro Val Lys Gly Glu He Tyr Leu Ser Asp Asn Ala Val 

55 60 65 

TTA CCG ATT TTC CAA CAC CCT TTA ACT AGC TTT AAC CCT GAT TGG ACG 1015 

Leu Pro He Phe Gin His Pro Leu Thr Ser Phe Asn Pro Asp Trp Thr 
70 75 80 

ATT GAG ACC TCA TTA AAA GAA GCG TTA TAT TAT TAC AG A GGT CTA ACT 1063 

He Glu Thr Ser Leu Lys Glu Ala Leu Tyr Tyr Tyr Arg Gly Leu Thr 

85 90 95 

GAT AAT ACT GCT CAG GAT CAA TTA TTA TTA CAA CAT TTA TCT ACT TTT 1111 

Asp Asn Thr Ala Gin Asp Gin Leu Leu Leu Gin His Leu Ser Thr Phe 
100 105 110 

GAG TTA AAC GCG CAA TTA TTG ACT AAA TTA CCA AGC GAA GTG AGT GGC 115 9 

Glu Leu Asn Ala Gin Leu Leu Thr Lys Leu Pro Ser Glu Val Ser Gly 
115 120 125 130 

GGA CAA TTA CAA AGA TTT AAT GTC ATG CGT TCG TTA TTA GCA CAG CCT 12 07 

Gly Gin Leu Gin Arg Phe Asn Val Met Arg Ser Leu Leu Ala Gin Pro 

135 140 145 



CGC GTT TTA ATA TGT GAT GAG ATA ACT TCA AAT TTA GAT GTT ATA GCT 12 55 

Arg Val Leu He Cys Asp Glu He Thr Ser Asn Leu Asp Val He Ala 
150 155 160 

GAA CAA AAT GTA ATC AAT ATA TTA AAA GCG CAA ACG ATT ACG AAC TTA 13 03 

Glu Gin Asn Val He Asn He Leu Lys Ala Gin Thr He Thr Asn Leu 
165 170 175 

AAT CAT TTT ATC GTT ATT TCT CAT GAT TTA TCC GTG TTA CAA CGC TTA 13 51 

Asn His Phe He Val He Ser His Asp Leu Ser Val Leu Gin Arg Leu 

180 185 190 

GTT AAT AGA ATT ATC GTT CTT AAG GAT GGC ATG ATA GTC GAT GAT TTT 13 99 

Val Asn Arg lie He Val Leu Lys Asp Gly Met He Val Asp Asp Phe 
195 200 205 " 210 



GCA ATA GAG GAA TTA TTT AAT GTT GAT AGA CAC CCT TAT ACA AAA GAA 144 7 

Ala He Glu Glu Leu Phe Asn Val Asp Arg His Pro Tyr Thr Lys Glu 
215 220 225 

TTA GTG CAA GCA TTT TCA TAT TAG 147! 
Leu Val Gin Ala Phe Ser Tyr 
230 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met He Glu Leu Lys His Val Thr Phe Gly Tyr Asn Lys Lys Gin Met 
1 5 10 ' 15 

Val Leu Gin Asp He Asn He Thr He Pro Asp Gly Glu Asn Val Gly 
20 25 " 30 

He Leu Gly Glu Ser Gly Cys Gly Lys Ser Thr Leu Ala Ser Leu Val 
35 40 45 

Leu Gly Leu Phe Lys Pro Val Lys Gly Glu He Tyr Leu Ser Asp Asn 
50 55 60 

Ala Val Leu Pro He Phe Gin His Pro Leu Thr Ser Phe Asn Pro Asp 
65 70 75 80 

Trp Thr He Glu Thr Ser Leu Lys Glu Ala Leu Tyr' Tyr Tyr Arg Gly 
85 90 ' 95 

Leu Thr Asp Asn Thr Ala Gin Asp Gin Leu Leu Leu Gin His Leu Ser 
100 105 no 

Thr Phe Glu Leu Asn Ala Gin Leu Leu Thr Lys Leu Pro Ser Glu Val 
115 120 125 

Ser Gly Gly Gin Leu Gin Arg Phe Asn Val Met Arg Ser Leu Leu Ala 
130 135 140 

Gin Pro Arg Val Leu He Cys Asp Glu He Thr Ser Asn Leu Asp Val 
145 150 155 160 

He Ala Glu Gin Asn Val He Asn He Leu Lys Ala Gin Thr lie Thr 
165 170 175 

Asn Leu Asn His Phe He Val He Ser His Asp Leu Ser Val Leu Gin 
180 185 190 

Arg Leu Val Asn Arg He lie Val Leu Lys Asp Gly Met lie Val Asp 
195 200 205 

Asp Phe Ala lie Glu Glu Leu Phe Asn Val Asp Arg His Pro Tyr Thr 
210 215 220 

Lys Glu Leu Val Gin Ala Phe Ser Tyr 
225 230 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " DNA (genomic) (p!3b74) » 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



AATCATAAAT 


TACGAATTAG 


ATATAAC AAA 


AAAGAGCTTG 


GGACATTAAG 


TCCTTAAAGT 


60 


CTTAGGCAAT 


GTAAAAAAGC 


TGATTTCTAT 


TATTTATTTG 


ATAGAAATCA 


GCTTTTTTGA 


120 


TATGTATTTT 


ATAATGTACA 


GCTCGTTGAG 


CTGCTATTTT 


CCTTATATTA 


AGTGCCATTA 


180 


ATACAAAACC 


TAGCTCTCGT 


TTAAC TTT AT 


TTATTCCCGA 


ACTGACATTC 


GAGTGAAACC 


240 


CAAAATAGCC 


TTCTAATCCA 


AAAACAGGCT 


CTACTCAATT 


TTCCTTTTGA 


CTAATAGATT 


300 


TTTTCGTTTC 


TGGTTCCAAA 


GCCTTGAGGT 


TGTAGAATTC 


TTGAAGACGA 


AAAGGGCCCC 


360 


GTGATACGCC 


CTATTTTTAA 


AGGTTAATGT 


TCTGAA 






396 



(2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1845 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (pl4cl5) " 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 3. .788 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 856. .1842 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:22: 

GC ATC AAT GAA CAT CAG GAA TTA ACA ACT TTA GGC AGA GGT GGT TCT 4 7 

lie Asn Glu His Gin Glu Leu Thr Thr Leu Gly Arg Gly Gly Ser 
1 5 10 15 

GAT ACG ACC GCT GTG GCA CTT GCT GTT AGT AAT CAA ATA CCT TGT GAA 95 
Asp Thr Thr Ala Val Ala Leu Ala Val Ser Asn Gin He Pro Cys Glu 
20 25 30 

ATT TAT ACC GAC GTT GAT GGT GTG TAT GCC ACT GAC CCA AGA CTT TTA 143 
He Tyr Thr Asp Val Asp Gly Val Tyr Ala Thr Asp Pro Arg Leu Leu 
35 40 45 

CCA AAG GCT AAA CGA CTA GAC ATC GTC TCA TAT GAA GAA ATG ATG GAA 191 
Pro Lys Ala Lys Arg Leu Asp He Val Ser Tyr Glu Glu Met Met Glu 
50 55 60 



ATG AGC GCT TTA GGT GCT GGT GTA CTT GAA ACA AGA AGT GTT GAA TTA 
Met Ser Ala Leu Gly Ala Gly Val Leu Glu Thr Arg Ser Val Glu Leu 



239 
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65 70 75 

GCT AAA AAC TAT AAT ATC CCT TTA TAT TTA GGA AAA ACT TTA TCG AAC 2 87 

Ala Lys Asn Tyr Asn lie Pro Leu Tyr Leu Gly Lys Thr Leu Ser Asn 
80 85 90 95 

GTG AAA GGA ACA TGG ATT ATG TCA AAT GAA GAA ATA TTA GAG AAA AAA 335 
Val Lys Gly Thr Trp He Met Ser Asn Glu Glu He Leu Glu Lys Lys 
100 105 no 

GCA GTT ACT GGT GTG GCT TTG GAT AAA CAT ATG ATG CAT GTA ACA ATT 383 
Ala Val Thr Gly Val Ala Leu Asp Lys His Met Met His Val Thr He 
115 120 125 

AGT TAT CCC CTA CCT GAC AAT CAG CTA CTT ACC CAA CTA TTT ACG GAA 4 31 

Ser Tyr Pro Leu Pro Asp Asn Gin Leu Leu Thr Gin Leu Phe Thr Glu 
130 135 140 

CTT GAA GAA GGT GCT GTA AAT GTT GAT ATG ATT TCA CAA ATC GTC AAC 47 9 

Leu Glu Glu Gly Ala Val Asn Val Asp Met He Ser Gin He Val Asn 
14 5 150 155 

TTG GAT GGG CTA CAA CTA TCC TTC ACG ATT AAA GAT AGT GAT TTT CAT 527 
Leu Asp Gly Leu Gin Leu Ser Phe Thr He Lys Asp Ser Asp Phe His 
160 165 170 175 



CAA ATT TCT ATG ATT CTT GAA ACA TTA AAG AAT CAA TAT GAA GCA TTA 575 

Gin He Ser Met He Leu Glu Thr Leu Lys Asn Gin Tyr Glu Ala Leu 
180 185 190 

GCT TAT AAA ATC AAT GAG CAT TAT GTC AAA ATT TCA TTA ATT GGC TCA 62 3 

Ala Tyr Lys He Asn Glu His Tyr Val Lys He Ser Leu He Gly Ser 
195 200 205 

GGC ATG CGT GAT ATG TCA GGT GTG GCA TCA AAA GCA TTT TTG ACA TTA 671 

Gly Met Arg Asp Met Ser Gly Val Ala Ser Lys Ala Phe Leu Thr Leu 
210 215 220 



ATT GAA AAT AAT ATA CCT TTC TAC CAA ACA ACA ACA TCT GAA ATA AGT 719 

He Glu Asn Asn He Pro Phe Tyr Gin Thr Thr Thr Ser Glu He Ser 

225 230 235 240 

ATT TCA TAC GTC ATT GAT GAT TTT AAT GGG CAA CAA GCG GTA GAA AAA 767 

He Ser Tyr Val He Asp Asp Phe Asn Gly Gin Gin Ala Val Glu Lys 

245 250 255 



CTA TAT GAC GCA TTT AAC ATT TAATGGTAAA ATGATTGTTA AAATATTCTA 818 
Leu Tyr Asp Ala Phe Asn He 
260 

AAAATTGGGA AATTATTATA AAATGGAGTG ACAAGTT ATG ACA AAG TTA GCA GTT 873 

Met Thr Lys Leu Ala Val 
1 5 

GTG GGT GCA ACA GGA TTA GTA GGT ACA AAA ATG TTG GAG ACA TTA AAT 921 
Val Gly Ala Thr Gly Leu Val Gly Thr Lys Met Leu Glu Thr Leu Asn 
10 15 20 

CGT AAA AAT ATT CCT TTC GAT GAA TTA GTA TTA TTT TCA TCA GCA CGT 969 
Arg Lys Asn He Pro Phe Asp Glu Leu Val Leu Phe Ser Ser Ala Arg 
25 30 35 
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TCT 
Ser 


GCA 
Ala 
40 


GGG 
Gly 


CAA 
Gin 


GAA 
Glu 


PTT 

Val 


PAA 

Glu 
45 


TTT 
Phe 


PA A 

CAA 

Gin 


PP A 

GGA 

Gly 


AAA 
Lys 


ACA 
Thr 
50 


TAT 
Tyr 


ACA 
Thr 


GTT 
Val 


CAA 
Gin 


1017 


GAA 
Glu 
55 


TTA 
Leu 


ACT 
Thr 


GAT 
Asp 


PPT 

Ala 


PPT 
Li X 

Arg 
60 


Ala 


AC X 

Ser 


PA A 

GAA 

Glu 


CAT 
His 


TTC 
Phe 
65 


GAT 
Asp 


TAT 
Tyr 


GTA 
Val 


TTA 
Leu 


ATG 
Met 
70 


1065 


AGT 
Ser 


GCT 
Ala 


PPT 

Gly 


ppp 
Gly 


CZC2T 

Gly 
75 


apa 
Thr 


Ser 


pa t\ 
CAA 

Glu 


PA P 

CAC 
His 


TTT 
Phe 
80 


GCC 
Ala 


CCA 
Pro 


CTT 
Leu 


TTT 
Phe 


GAA 
Glu 
85 


AAA 
Lys 


1113 


GCT 
Ala 


GGT 
Gly 


ppa 
Ala 


ATP 

A.X C 

He 
90 


PTT 
ui X 

val 


ata 
lie 


PAP 

Asp 


AA1 

Asn 


TCA 
Ser 
95 


AGT 
Ser 


CAA 
Gin 


TGG 
Trp 


CGT 
Arg 


ATG 
Met 
100 


GCA 
Ala 


GAA 
Glu 


1161 


GAT 
Asp 


ATT 

lie 


GAT 
Asp 
105 


TTA 
Leu 


ATC 
He 


PTT 

Val 


ppp 

CCC 

Pro 


PA A 
CAA 

Glu 
110 


PTP 

Val 


AAT 

AA1 

Asn 


P A A 

CAA 

Glu 


CCT 
Pro 


A PA 

ACA 

Thr 
115 


TTT 
Phe 


ACA 
Thr 


AGA 
Arg 


1209 


GGT 
Gly 


ATP 

He 
120 


ATT 

He 


ppp 
Ala 


AT 
AAX 

Asn 


pp a 

CCA 

Pro 


A A P 

AAC 

Asn 
125 


TGC 
Cys 


TCT 
Ser 


ACG 
Thr 


ATT 
He 


CAA 
Gin 
130 


TCT 

Ser 


GTT 
Val 


GTA 
Val 


CCT 
Pro 


1257 


CTA 
Leu 
135 


AAA 
Lys 


PTA 

Val 


TTP 

Leu 


paa 
Gin 


PAT 

C A x 

Asp 
140 


GCT 
Ala 


TAT 
Tyr 


GGT 
Gly 


TTA 
Leu 


AAA 
Lys 
145 


CGA 
Arg 


GTG 
val 


GCA 
Ala 


TAT 
Tyr 


ACA 
Thr 
150 


1305 


ACA 
Thr 


TAT 
Tyr 


CAA 
Gin 


PPT 

O v_ X 

Ala 


PTA 

Val 
155 


X Lrt 

Ser 


PPT" 
CC X 

Gly 


TP A 
1 LA 

Ser 


CCC 

Gly 


ATG 
Met 
160 


AAA 

AAA 

Lys 


GGT 
Gly 


A A P 

AAC 
Lys 


AAA 

AAA 

Lys 


GAT 
Asp 
165 


TTA 
Leu 


1353 


GCT 
Ala 


paa 
Glu 


PPT 

Gly 


pta 
Val 
170 


a at 
Asn 


PPT 

Gly 


a a a 
AAA 

Lys 


PP A 

CCA 

Ala 


PP A 

CCA 

Pro 
175 


P A A 

CAA 

Glu 


PP A 

CCA 

Ala 


IA1 

Tyr 


PP A 

CCA 

Pro 


CAT 
His 
180 


CCA 
Pro 


ATT 
He 


1401 


TAT 
Tyr 


AAT 
Asn 


AAT 
Asn 
185 


PTP 

Val 


TTA 
Leu 


ppp 
Pro 


PAT 
CA 1 

His 


ATT 
AX I 

He 
190 


PAT 

Asp 


PTP 

bib 

Val 


TTT 
Phe 


TTA 

1 1A 

Leu 


PA A 

CAA 

Glu 
195 


AAC 
Asn 


GGA 
Gly 


TAT 
Tyr 


144 9 


ACA 
Thr 


AAA 
Lys 
200 


GAA 
Glu 


GAA 
Glu 


CAA 
Gin 


AAA 
Lys 


ATG 
Met 
205 


ATT 
He 


GAT 
Asp 


GAG 
Glu 


ACG 
Thr 


AGA 
Arg 
210 


AAA 
Lys 


ATT 
He 


TTA 
Leu 


AAT 
Asn 


1497 


GCG 
Ala 
215 


CCA 
Pro 


GAC 
Asp 


TTA 

Leu 


AAA 
Lys 


GTA 

Val 
220 


ACA 
Thr 


GCA 
Ala 


ACA 
Thr 


TGC 
Cys 


GCA 
Ala 
225 


CGT 
Arg 


GTG 
Val 


CCT 
Pro 


GTT 
Val 


CAA 
Gin 
230 


154 5 


GAT 
Asp 

PPA 

Ala 


AGT 
Ser 

VarvrV 

Glu 


CAT 
His 

pat 
Asp 


AGT 
Ser 

ATT 
He 
250 


GTT 
Val 
235 

A A A 
AAA 

Lys 


GAA 
Glu 

ppp 
Ala 


ATT 
He 

TTA 
1 1A 

Leu 


GAT 
Asp 

TTT 
Phe 


GTA 
Val 

PAT 

bAl 

Asp 
255 


ACG 
Thr 
240 

PA A 

CAA 

Gin 


CTT 
Leu 

PAT 

CAx 

Asp 


GAC 
Asp 

PAP 

CAC 

Asp 


AAA 
Lys 

CGC 
Arg 


GAA 
Glu 

GTT 
Val 
260 


ACA 
Thr 
245 
GTT 
Val 


ACA 
Thr 

TTA 
Leu 


1593 
1641 


GTA 
Val 


GAC 
Asp 


AAT 
Asn 
265 


CCA 
Pro 


GAG 
Glu 


AAC 
Asn 


AAT 

Asn 


GAA 
Glu 
270 


TAT 
Tyr 


CCA 
Pro 


ATG 
Met 


GCA 
Ala 


ATC 
He 
275 


AAT 
Asn 


TCT 
Ser 


ACT 
Thr 


1689 


AAT 
Asn 


AAA 
Lys 
280 


GAT 
Asp 


GAA 
Glu 


GTG 
Val 


TTT 
Phe 


GTT 
Val 
285 


GGC 
Gly 


CGT 
Arg 


ATA 
He 


CGT 
Arg 


AGA 
Arg 
290 


GAT 
Asp 


GAT 
Asp 


TCA 
Ser 


TTA 
Leu 


1737 


GAA 


AAT 


ACT 


TTC 


CAT 


GTA 


TGG 


TGT 


ACA 


TCA 


GAC 


AAT 


TTA 


TTA 


AAA 


GGT 


1785 
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Glu Asn Thr Phe His Val Trp Cys Thr Ser Asp Asn Leu Leu Lys Gly 
295 300 305 310 

GCT GCA TTA AAT GCT GTA CAA GTA TTG GAA CAA GTT ATG CGT TTA AAA 
Ala Ala Leu Asn Ala Val Gin Val Leu Glu Gin Val Met Arg Leu Lys 
315 320 325 

GGA GCG AAT TAA 
Gly Ala Asn 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 62 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

He Asn Glu His Gin Glu Leu Thr Thr Leu Gly Arg Gly Gly Ser Asp 
1 5 10 15 

Thr Thr Ala Val Ala Leu Ala Val Ser Asn Gin He Pro Cys Glu He 
20 25 30 

Tyr Thr Asp Val Asp Gly Val Tyr Ala Thr Asp Pro Arg Leu Leu Pro 
35 40 45 

Lys Ala Lys Arg Leu Asp He Val Ser Tyr Glu Glu Met Met Glu Met 
50 55 60 



Ser Ala Leu Gly Ala Gly Val Leu Glu Thr Arg Ser Val Glu Leu Ala 
65 70 75 80 

Lys Asn Tyr Asn He Pro Leu Tyr Leu Gly Lys Thr Leu Ser Asn Val 
85 90 95 

Lys Gly Thr Trp He Met Ser Asn Glu Glu He Leu Glu Lys Lys Ala 
100 105 110 

Val Thr Gly Val Ala Leu Asp Lys His Met Met His Val Thr He Ser 
115 120 125 

Tyr Pro Leu Pro Asp Asn Gin Leu Leu Thr Gin Leu Phe Thr Glu Leu 
130 135 140 

Glu Glu Gly Ala Val Asn Val Asp Met He Ser Gin He Val Asn Leu 
145 150 155 160 

Asp Gly Leu Gin Leu Ser Phe Thr He Lys Asp Ser Asp Phe His Gin 
165 170 175 

He Ser Met lie Leu Glu Thr Leu Lys Asn Gin Tyr Glu Ala Leu Ala 
180 185 190 

Tyr Lys lie Asn Glu His Tyr Val Lys lie Ser Leu lie Gly Ser Gly 
195 200 205 

Met Arg Asp Met Ser Gly Val Ala Ser Lys Ala Phe Leu Thr Leu lie 
210 215 220 
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Glu Asn Asn lie Pro Phe Tyr Gin Thr Thr Thr Ser Glu lie Ser lie 
225 230 235 240 

Ser Tyr Val He Asp Asp Phe Asn Gly Gin Gin Ala Val Glu Lys Leu 
245 250 255 

Tyr Asp Ala Phe Asn He 
260 



(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 329 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Met Thr Lys Leu Ala Val Val Gly Ala Thr Gly Leu Val Gly Thr Lys 
15 10 15 

Met Leu Glu Thr Leu Asn Arg Lys Asn He Pro Phe Asp Glu Leu Val 
20 25 30 

Leu Phe Ser Ser Ala Arg Ser Ala Gly Gin Glu Val Glu Phe Gin Gly 
35 40 45 

Lys Thr Tyr Thr Val Gin Glu Leu Thr Asp Ala Arg Ala Ser Glu His 
50 55 60 

Phe Asp Tyr Val Leu Met Ser Ala Gly Gly Gly Thr Ser Glu His Phe 
65 70 75 80 

Ala Pro Leu Phe Glu Lys Ala Gly Ala He Val He Asp Asn Ser Ser 
85 90 95 

Gin Trp Arg Met Ala Glu Asp He Asp Leu He Val Pro Glu Val Asn 
100 105 110 

Glu Pro Thr Phe Thr Arg Gly He He Ala Asn Pro Asn Cys Ser Thr 
115 120 125 

He Gin Ser Val Val Pro Leu Lys Val Leu Gin Asp Ala Tyr Gly Leu 
130 135 140 

Lys Arg Val Ala Tyr Thr Thr Tyr Gin Ala Val Ser Gly Ser Gly Met 
145 150 155 160 

Lys Gly Lys Lys Asp Leu Ala Glu Gly Val Asn Gly Lys Ala Pro Glu 
165 170 175 

Ala Tyr Pro His Pro He Tyr Asn Asn Val Leu Pro His He Asp Val 
180 185 190 

Phe Leu Glu Asn Gly Tyr Thr Lys Glu Glu Gin Lys Met He Asp Glu 
195 200 * 205 

Thr Arg Lys He Leu Asn Ala Pro Asp Leu Lys Val Thr Ala Thr Cys 
210 215 220 

Ala Arg Val Pro Val Gin Asp Ser His Ser Val Glu He Asp Val Thr 
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225 230 235 240 

Leu Asp Lys Glu Thr Thr Ala Glu Asp lie Lys Ala Leu Phe Asp Gin 
245 250 255 

Asp Asp Arg Val Val Leu Val Asp Asn Pro Glu Asn Asn Glu Tyr Pro 
260 265 270 

Met Ala lie Asn Ser Thr Asn Lys Asp Glu Val Phe Val Gly Arg He 
275 280 2 85 

Arg Arg Asp Asp Ser Leu Glu Asn Thr Phe His Val Trp Cys Thr Ser 
290 295 300 

Asp Asn Leu Leu Lys Gly Ala Ala Leu Asn Ala Val Gin Val Leu Glu 
305 310 315 320 

Gin Val Met Arg Leu Lys Gly Ala Asn 
325 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 927 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (pl3b26) " 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1 . . 924 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

ATG AAC GAA GCC GAC ATG CTG TTC TCT GTC ACT GTT CCC GGA AGC ACA 

Met Asn Glu Ala Asp Met Leu Phe Ser Val Thr Val Pro Gly Ser Thr 

15 10 15 



GCT AAC CTA GGC CCC GGC TTT GAT TCA GTC GGA ATG GCG CTC AGC AGA 96 

Ala Asn Leu Gly Pro Gly Phe Asp Ser Val Gly Met Ala Leu Ser Arg 

20 25 30 

TAT TTG AAG CTG ACC GTC TTT GAA AGC GAC AAA TGG TCT TTT GAG GCT 144 

Tyr Leu Lys Leu Thr Val Phe Glu Ser Asp Lys Trp Ser Phe Glu Ala 

35 40 ^ " 45 

GAA ACA GAA ACA GTC GCC GGA ATT CGC GGT ACA GAT AAC CTG ATC TAC 192 

Glu Thr Glu Thr Val Ala Gly He Arg Gly Thr Asp Asn Leu He Tyr 

50 55 60 

CAA GTG GCT AAA CGG ACC GCA GAT TTG TAC GGA AAA GAA ATG CCT CCT 240 

Gin Val Ala Lys Arg Thr Ala Asp Leu Tyr Gly Lys Glu Met Pro Pro 

65 70 75 80 

GTC CAT GTG AAG GTG TGG AGC GAC ATC CCG CTT GCA CGC GGC CTT GGC 2 88 

Val His Val Lys Val Trp Ser Asp He Pro Leu Ala Arg Gly Leu Gly 

85 90 95 

AGC AGC GCC GCA GCG ATT GTA GCG GCC ATT GAA CTG GCT GAT GAA TTA 336 
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Ser Ser Ala Ala Ala lie Val Ala Ala He Glu Leu Ala Asp Glu Leu 
100 105 no 

TGC GGC TTA AAG CTG TCT GAA GCG GAC AAG CTG CAT TTA GCG AGT CTA 3 
Cys Gly Leu Lys Leu Ser Glu Ala Asp Lys Leu His Leu Ala Ser Leu 
115 120 125 

GAA GAA GGA CAC CCG GAC AAT GCT GGC GCT TCT CTC GTC GGC GGA CTT 4 
Glu Glu Gly His Pro Asp Asn Ala Gly Ala Ser Leu Val Gly Gly Leu 
130 135 140 

GTG ATC GGC CTG CAT GAG GAT GAC GAG ACC CAA ATG ATC CGC GTC CCG 4 
Val He Gly Leu His Glu Asp Asp Glu Thr Gin Met He Arg Val Pro 
145 150 155 160 

AAT GCT GAC ATT GAC GTA GTC GTT GTC ATT CCT TTT TAT GAA GTG CTG 5 
Asn Ala Asp He Asp Val Val Val Val He Pro Phe Tyr Glu Val Leu 
165 170 175 

ACA AGA GAC GCG AGA GAC GTG CTT CCG AAG GAG TTT CCA TAT GCC GAT 5 
Thr Arg Asp Ala Arg Asp Val Leu Pro Lys Glu Phe Pro Tyr Ala Asp 
180 185 190 

GCC GTA AAA GCA AGT GCT GTC AGC AAT ATC CTC ATT GCT GCG ATC ATG 6, 
Ala Val Lys Ala Ser Ala Val Ser Asn lie Leu He Ala Ala He Met 
195 200 205 

TCC AAG GAT TGG CCG CTT GTC GGG AAA ATC ATG AAG AAG GAT ATG TTC 6 
Ser Lys Asp Trp Pro Leu Val Gly Lys He Met Lys Lys Asp Met Phe 
210 215 220 

CAT CAG CCG TAC CGG GCG ATG CTT GTG CCT GAG CTG TCA AAA GTA GAG 1. 
His Gin Pro Tyr Arg Ala Met Leu Val Pro Glu Leu Ser Lys Val Glu 
225 230 235 240 

CAC GTC GCC GAG ATG AAG GGC GCA TAT GGA ACG GCT CTC AGC GGA GCA 7i 
His Val Ala Glu Met Lys Gly Ala Tyr Gly Thr Ala Leu Ser Gly Ala 
245 250 255 

GGC CCA ACG ATT CTC GTC ATG ACC GAA AAA GGA AAG GGA GAA GAG CTA 8 
Gly Pro Thr lie Leu Val Met Thr Glu Lys Glv Lys Gly Glu Glu Leu 
260 265 270 

AAA GAA CAG CTC GCG CTT CAT TTC CCT CAT TGT GAA GTA GAC GCT TTG 8< 
Lys Glu Gin Leu Ala Leu His Phe Pro His Cys Glu Val Asp Ala Leu 
275 280 285 



ACC GTT CCG AAA GAG GGA AGT ATA ATA GAG CGA AAT CCT TTA TAT CAA 

Thr Val Pro Lys Glu Gly Ser He He Glu Arg Asn Pro Leu Tyr Gin 
290 295 300 

GTA AAA AGT GTA TAG 

Val Lys Ser Val 
305 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Asn Glu Ala Asp Met Leu Phe Ser Val Thr Val Pro Gly Ser Thr 
1 5 10 is 

Ala Asn Leu Gly Pro Gly Phe Asp Ser Val Gly Met Ala Leu Ser Arq 
20 25 ' 30 

Tyr Leu Lys Leu Thr Val Phe Glu Ser Asp Lys Trp Ser Phe Glu Ala 
35 40 45 

Glu Thr Glu Thr Val Ala Gly He Arg Gly Thr Asp Asn Leu He Tyr 
50 55 60 

Gin Val Ala Lys Arg Thr Ala Asp Leu Tyr Gly Lys Glu Met Pro Pro 
65 70 75 so 

Val His Val Lys Val Trp Ser Asp He Pro Leu Ala Arg Gly Leu Gly 
85 90 95 

Ser Ser Ala Ala Ala He Val Ala Ala He Glu Leu Ala Asp Glu Leu 
100 105 no 

Cys Gly Leu Lys Leu Ser Glu Ala Asp Lys Leu His Leu Ala Ser Leu 
H5 120 125 

Glu Glu Gly His Pro Asp Asn Ala Gly Ala Ser Leu Val Gly Gly Leu 
130 135 * 140 

Val He Gly Leu His Glu Asp Asp Glu Thr Gin Met He Arg Val Pro 
145 150 155 i 6 o 

Asn Ala Asp He Asp Val Val Val Val He Pro Phe Tyr Glu Val Leu 
165 170 ' 175 

Thr Arg Asp Ala Arg Asp Val Leu Pro Lys Glu Phe Pro Tyr Ala Asp 
180 185 190 

Ala Val Lys Ala Ser Ala Val Ser Asn He Leu He Ala Ala lie Met 
195 200 205 

Ser Lys Asp Trp Pro Leu Val Gly Lys He Met Lys Lys Asp Met Phe 
210 215 220 

His Gin Pro Tyr Arg Ala Met Leu Val Pro Glu Leu Ser Lys Val Glu 
225 '230 235 ' 240 

His Val Ala Glu Met Lys Gly Ala Tyr Gly Thr Ala Leu Ser Gly Ala 
245 250 255 

Gly Pro Thr He Leu Val Met Thr Glu Lys Gly Lys Gly Glu Glu Leu 
260 265 270 

Lys Glu Gin Leu Ala Leu His Phe Pro His Cys Glu Val Asp Ala Leu 
2 75 280 285 

Thr Val Pro Lys Glu Gly Ser He He Glu Arg Asn Pro Leu Tyr Gin 
290 295 300 

Val Lys Ser Val 
305 

(2) INFORMATION FOR SEQ ID NO: 27: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION : /desc = "DNA (genomic) < P 7cl8)» 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..1317 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

TTG TTC TTA CAC GGC ACA AGC AGA CAA AAT CAA CAT GGT CAT TTA GAA 4fl 
Leu Phe Leu His Gly Thr Ser Arg Gin Asn Gin His Gly His Leu G^ 48 
1 5 io 15 

ATC GGA GGT GTG GAT GCT CTC TAT TTA GCG GAG AAA TAT GGT ACA CCT 
He Gly Gly Val Asp Ala Leu Tyr Leu Ala Glu Lys Tyr Sy Thr Pro ^ 
20 25 



30 



CTT TAC GTA TAT GAT GTG GCT TTA ATA CGT GAG CGT GCT AAA AGC TTT 
Leu Tyr Val Tyr Asp Val Ala Leu lie Arg Glu Arg Ala Lys Ser Phe 
35 40 45 



144 



192 



AAG CAG GCG TTT ATT TCT GCA GGG CTG AAA GCA CAG GTG GCA TAT GCG 
Lys Gin Ala Phe He Ser Ala Gly Leu Lys Ala Gin Val Ala Tyr Ala 
50 55 60 

^ ll C TCA TCA GTC GCA ATG ATT C ^ G CTC GCT GAG GAA GAG 24 0 

Ser Lys Ala Phe Ser Ser Val Ala Met He Gin Leu Ala Glu Glu Glu 

65 70 75 go 

GGA CTT TCT TTA GAT GTC GTA TCC GGA GGA GAG CTA TAT ACG GCT GTT 
Gly Leu Ser Leu Asp Val Val Ser Gly Gly Glu Leu Tyr Thr Ala Val 
85 90 ' 95 

GCA GCA GGC TTT CCG GCA GAA CGC ATC CAC TTT CAT GGA AAC A£T AAO 
Ala Ala Gly Phe Pro Ala Glu Arg lie His Phe His Gly !£n fsn ^s 
100 105 110 

AGC AGG GAA GAA CTG CGG ATG GCG CTT GAG CAC CGC ATC GGC TGC ATT 384 
Ser Arg Glu Glu Leu Arg Met Ala Leu Glu His Arg lie Gly Cys lie 
115 120 125 



288 



GTG GTG GAT AAT TTC TAT GAA ATC GCG CTT CTT GAA GAC CTA TGT AAA 
Val Val Asp Asn Phe Tyr Glu He Ala Leu Leu Glu Asp Leu Cys Lys 
130 135 14Q 

GAA ACG GGT CAC TCC ATC GAT GTT CTT CTT CGG ATC ACG CCC GGA GTA 
Glu Thr Gly His Ser lie Asp Val Leu Leu Arg He Thr Pro Gly Val 
145 150 155 160 

GAA GCG CAT ACG CAT GAC TAC ATT ACA ACG GGC CAG GAA GAT TCA AAG 
Glu Ala His Thr His Asp Tyr lie Thr Thr Gly Gin Glu Asp Ser Lys 
165 170 i7 5 y 

TTT GGT TTC GAT CTT CAT AAC GGA CAA ACT GAA CGG GCC ATT GAA CAA SIS 



432 



480 



528 
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Phe Gly Phe Asp Leu His Asn Gly Gin Thr Glu Arg Ala lie Glu Gin 
180 185 190 

GTA TTA CAA TCG GAA CAC ATT CAG CTG CTG GGT GTC CAT TGC CAT ATC 
Val Leu Gin Ser Glu His lie Gin Leu Leu Gly Val His Cys His lie 
195 200 2 05 

GGC TCG CAA ATC TTT GAT ACG GCC GGT TTT GTG TTA GCA GCG GAA AAA 
Gly Ser Gin He Phe Asp Thr Ala Gly Phe Val Leu Ala Ala Glu Lys 
210 215 220 

ATC TTC AAA AAA CTA GAC GAA TGG AGA GAT TCA TAT TCA TTT GTA TCC 
lie Phe Lys Lys Leu Asp Glu Trp Arg Asp Ser Tyr Ser Phe Val Ser 
225 23 <> 235 240 

AAG GTG CTG AAT CTT GGA GGA GGT TTC GGC ATT CGT TAT ACG GAA GAT 
Lys Val Leu Asn Leu Gly Gly Gly Phe Gly He Arg Tyr Thr Glu Asp 
245 250 ~ 255 

GAT GAA CCG CTT CAT GCC ACT GAA TAC GTT GAA AAA ATT ATC GAA GCT 
Asp Glu Pro Leu His Ala Thr Glu Tyr Val Glu Lys He He Glu Ala 
260 265 ' 270 

GTG AAA GAA AAT GCT TCC CGT TAC GGT TTT GAC ATT CCG GAA ATT TGG 
Val Lys Glu Asn Ala Ser Arg Tyr Gly Phe Asp He Pro Glu lie TrD 
275 280 ^ 285 

ATC GAA CCG GGC CGT TCT CTC GTG GGA GAC GCA GGC ACA ACT CTT TAT 
He Glu Pro Gly Arg Ser Leu Val Gly Asp Ala Gly Thr Thr Leu Tyr 
290 295 300 

ACG GTT GGC TCT CAA AAA GAA GTG CCG GGT GTC CGC CAA TAT GTG GCT 
Thr Val Gly Ser Gin Lys Glu Val Pro Gly Val Arg Gin Tyr Val Ala 
305 310 315 320 

GTA GAC GGA GGC ATG AAC GAC AAT ATT CGT CCT GCG CTT TAC CAA GCT 
Val Asp Gly Gly Met Asn Asp Asn He Arg Pro Ala Leu Tyr Gin Ala 
325 330 335 

AAA TAT GAA GCT GCG GCA GCC AAC AGG ATC GGA GAA GCG CAT GAC AAA 
Lys Tyr Glu Ala Ala Ala Ala Asn Arg He Gly Glu Ala His Asp Lys 
340 345 350 

ACG GTA TCA ATT GCC GGA AAG TGC TGT GAA AGC GGA GAT ATG CTG ATT 
Thr Val Ser He Ala Gly Lys Cys Cys Glu Ser Gly Asp Met Leu He 
355 360 365 

TGG GAT ATT GAC CTG CCG GAA GTA AAA GAA GGC GAT CTT CTT GCC GTT 
Trp Asp lie Asp Leu Pro Glu Val Lys Glu Gly Asp Leu Leu Ala Val 
370 375 380 

TTT TGT ACA GGC GCT TAT GGA TAC AGC ATG GCC AAC AAT TAT AAC CGT 
Phe Cys Thr Gly Ala Tyr Gly Tyr Ser Met Ala Asn Asn Tyr Asn Arq 
385 390 395 400 

ATT CCG AGA CCC GCC GTT GTA TTT GTC GAA AAC GGT GAG GCT CAT TTA 
lie Pro Arg Pro Ala Val Val Phe Val Glu Asn Gly Glu Ala His Leu 
405 410 415 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



GTC GTG AAG CGA GAA ACA TAC GAA GAT ATT GTA AAA CTT GAT CTG CCA 1296 
Val Val Lys Arg Glu Thr Tyr Glu Asp lie Val Lys Leu Asp Leu Pro ^ 
420 425 430 



TTT AAA ACG GGT GTA AAG CAA TAA 



1320 
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Phe Lys Thr Gly Val Lys Gin 
435 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 439 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Leu Phe Leu His Gly Thr Ser Arg Gin Asn Gin His Gly His Leu Glu 
1 5 10 15 

lie Gly Gly Val Asp Ala Leu Tyr Leu Ala Glu Lys Tyr Gly Thr Pro 
20 25 30 

Leu Tyr Val Tyr Asp Val Ala Leu He Arg Glu Arg Ala Lys Ser Phe 
35 40 45 ^ 

Lys Gin Ala Phe He Ser Ala Gly Leu Lys Ala Gin Val Ala Tvr Ala 
50 55 60 

Ser Lys Ala Phe Ser Ser Val Ala Met He Gin Leu Ala Glu Glu Glu 
65 70 75 80 

Gly Leu Ser Leu Asp Val Val Ser Gly Gly Glu Leu Tyr Thr Ala Val 
85 90 * 95 

Ala Ala Gly Phe Pro Ala Glu Arg lie His Phe His Gly Asn Asn Lys 
100 105 110 

Ser Arg Glu Glu Leu Arg Met Ala Leu Glu His Arg He Gly Cys He 
115 120 125 

Val Val Asp Asn Phe Tyr Glu lie Ala Leu Leu Glu Asp Leu Cys Lys 
13 0 135 14 0 

Glu Thr Gly His Ser He Asp Val Leu Leu Arg lie Thr Pro Gly Val 
145 150 155 160 

Glu Ala His Thr His Asp Tyr lie Thr Thr Gly Gin Glu Asp Ser Lys 
165 170. 175 

Phe Gly Phe Asp Leu His Asn Gly Gin Thr Glu Arg Ala lie Glu Gin 
180 185 190 

Val Leu Gin Ser Glu His He Gin Leu Leu Gly Val His Cys His lie 
195 200 205 

Gly Ser Gin He Phe Asp Thr Ala Gly Phe Val Leu Ala Ala Glu Lys 
210 215 220 

He Phe Lys Lys Leu Asp Glu Trp Arg Asp Ser Tyr Ser Phe Val Ser 
225 230 235 240 

Lys Val Leu Asn Leu Gly Gly Gly Phe Gly lie Arg Tyr Thr Glu Asp 
245 250 255 



Asp Glu Pro Leu His Ala Thr Glu Tyr Val Glu Lys lie lie Glu Ala 
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260 265 270 

Val Lys Glu Asn Ala Ser Arg Tyr Gly Phe Asp lie Pro Glu lie Trp 
275 280 285 

lie Glu Pro Gly Arg Ser Leu Val Gly Asp Ala Gly Thr Thr Leu Tyr 
290 295 300 

Thr Val Gly Ser Gin Lys Glu Val Pro Gly Val Arg Gin Tyr Val Ala 
305 310 315 320 

Val Asp Gly Gly Met Asn Asp Asn lie Arg Pro Ala Leu Tyr Gin Ala 
325 330 335 

Lys Tyr Glu Ala Ala Ala Ala Asn Arg lie Gly Glu Ala His Asp Lys 
340 345 350 

Thr Val Ser He Ala Gly Lys Cys Cys Glu Ser Gly Asp Met Leu He 
355 360 365 

Trp Asp He Asp Leu Pro Glu Val Lys Glu Gly Asp Leu Leu Ala Val 
370 375 380 

Phe Cys Thr Gly Ala Tyr Gly Tyr Ser Met Ala Asn Asn Tyr Asn Arg 
385 390 395 400 

He Pro Arg Pro Ala Val Val Phe Val Glu Asn Gly Glu Ala His Leu 
405 410 415 

Val Val Lys Arg Glu Thr Tyr Glu Asp He Val Lys Leu Asp Leu Pro 
420 425 430 

Phe Lys Thr Gly Val Lys Gin 
435 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2190 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (pl5c31) " 
Unknown = Xaa 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..2187 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

ATG TCT AAA TTT ATC GAA CCA AGC GTT GAA GAA ATT AAA CTT GAA AAA 4 8 

Met Ser Lys Phe He Glu Pro Ser Val Glu Glu He Lys Leu Glu Lys 
15 10 15 

GTA TAT CAA GAT ATG GGA TTA AGT GAT CAA GAA TAT GAA AAA GTT TGC 96 
Val Tyr Gin Asp Met Gly Leu Ser Asp Gin Glu Tyr Glu Lys Val Cys 
20 25 30 

GAT ATT TTA GGC AGA CAA CCT AAC TTT ACA GAA ACA GGT ATC TTT TCT 144 
Asp He Leu Gly Arg Gin Pro Asn Phe Thr Glu Thr Gly He Phe Ser 
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35 



40 



45 



GTT ATG TGG AGT GAA CAT TGC TCT TAT AAA CAT TCT AAA CCG TTT TTA 192 
Val Met Trp Ser Glu His Cys Ser Tyr Lys His Ser Lys Pro Phe Leu 
• 50 55 60 

AAG CAA TTT CCT ACG TCA GGT GAC CAT GTG CTT ATG GGG CCT GGT GAA 240 
Lys Gin Phe Pro Thr Ser Gly Asp His Val Leu Met Gly Pro Gly Glu 
65 70 75 80 

GGT GCA GGG GTA GTC GAT ATA GGT GAT AAT CAA GCC GTA GTA TTT AAA 288 
Gly Ala Gly Val Val Asp lie Gly Asp Asn Gin Ala Val Val Phe Lys 
85 90 95 

GTA GAG TCT CAC AAT CAT CCA TCA GCA ATT GAA CCA TAT CAA GGG GCT 3 36 

Val Glu Ser His Asn His Pro Ser Ala He Glu Pro Tyr Gin Gly Ala 
100 105 HO 

GCT ACA GGC GTT GGT GGA ATC ATT CGT GAC ATT GTC TCT ATT GGG GCT 3 84 

Ala Thr Gly Val Gly Gly He He Arg Asp He Val Ser He Gly Ala 
115 120 125 

AGA CCT ATT AAT TTG TTA AAC AGT CTT AGA TTT GGA GAA TTA GAT AAT 4 32 

Arg Pro He Asn Leu Leu Asn Ser Leu Arg Phe Gly Glu Leu Asp Asn 
130 135 140 

AAA CAA AAC CAA AGA TTA CTT AAA GGT GTT GTA AAG GGT ATC GGA GGT 480 
Lys Gin Asn Gin Arg Leu Leu Lys Gly Val Val Lys Gly He Gly Gly 
145 150 155 160 

TAT GGT AAC TGC ATT GGT ATT CCA ACA ACT GCT GGT GAA ATC GAA TTT 528 
Tyr Gly Asn Cys He Gly He Pro Thr Thr Ala Gly Glu He Glu Phe 
165 , 170 175 

GAT GAA CGT TAT GAT GGC AAT CCA CTT GTT AAT GCA ATG TGT GTT GGT 576 
Asp Glu Arg Tyr Asp Gly Asn Pro Leu Val Asn Ala Met Cys Val Gly 
180 185 190 

GTT ATC AAT CAC GAC ATG ATT CAA AAA GGC ACA GCA AAA GGT GTA GGT 624 
Val lie Asn His Asp Met He Gin Lys Gly Thr Ala Lys Gly Val Gly 
195 200 205 

AAT TCG GTC ATT TAT GTT GGT TTG AAA ACT GGT CGA GAT GGT ATT CAT 672 
Asn Ser Val He Tyr Val Gly Leu Lys Thr Gly Arg Asp Gly He His 
210 215 220 

GGT GCT ACT TTT GCA TCT GAA GAA TTG ACG GAA GAA AGC GAA AGT AAA 72 0 

Gly Ala Thr Phe Ala Ser Glu Glu Leu Thr Glu Glu Ser Glu Ser Lys 
225 230 235 240 

CGA CCT TCT GTA CAA ATC GGT GAT CCA TTT GTA GGT AAA AAA TTA ATG 768 
Arg Pro Ser Val Gin He Gly Asp Pro Phe Val Gly Lys Lys Leu Met 
245 250 255 

GAA GCA ACA CTT GAA GCA ATT ACA TTT GAT GAA TTA GTT GGT ATT CAA 816 
Glu Ala Thr Leu Glu Ala He Thr Phe Asp Glu Leu Val Gly He Gin 
260 265 270 

GAT ATG GGT GCT GCT GGT TTA ACA TCT TCA TCG TCT GAA ATG GCG GCA 864 
Asp Met Gly Ala Ala Gly Leu Thr Ser Ser Ser Ser Glu Met Ala Ala 
275 280 285 

AAA GGT GGT AGT GGG TTA CAT TTG AGA TTA GAA CAA GTG CCA ACA CGT 912 
Lys Gly Gly Ser Gly Leu His Leu Arg Leu Glu Gin Val Pro Thr Arg 
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290 295 300 

GAG CCA GGT ATT TCT CCT TAT GAA ATG ATG CTT TCA GAA ACT CAA GAA 960 
Glu Pro Gly He Ser Pro Tyr Glu Met Met Leu Ser Glu Thr Gin Glu 
305 310 315 320 



CGT ATG TTA CTA GTT GTT GAA AAA GGT ACT GAA CAA AAA TTC TTA GAT 1008 
Arg Met Leu Leu Val Val Glu Lys Gly Thr Glu Gin Lys Phe Leu Asp 
325 330 335 

TTA TTT GAT AAG CAC GAA TTG GAT AGT GCT GTT ATA GGT GAA GTT ACA 1056 
Leu Phe Asp Lys His Glu Leu Asp Ser Ala Val He Gly Glu Val Thr 
340 345 350 

GAT ACA AAT CGT TTT GTT TTA ACA TAT GAT GAC GAA GTT TAT GCT GAC 1104 
Asp Thr Asn Arg Phe Val Leu Thr Tyr Asp Asp Glu Val Tyr Ala Asp 
355 360 365 

ATT CCA GTT GAA CCA CTA GCT GAT GAA GCA CCT GTA TAT ATT TTA GAA 1152 
He Pro Val Glu Pro Leu Ala Asp Glu Ala Pro Val Tyr He Leu Glu 
370 375 380 

GGA GAA GAA AAA GAT TAT AAT ACT TCT AAA AAT GAT TAT ACA CAC ATC 1200 
Gly Glu Glu Lys Asp Tyr Asn Thr Ser Lys Asn Asp Tyr Thr His He 
385 390 395 400 

GAT GTT AAA GAT ACT TTC TTT AAA TTA CTT AAG CAT CCG ACT ATA GCA 124 8 

Asp Val Lys Asp Thr Phe Phe Lys Leu Leu Lys His Pro Thr He Ala 
4 05 410 415 

TCT AAA CAC TAT TTA TAT GAT CAA TAC GAC CAA CAA GTT GGT GCC AAT 1296 
Ser Lys His Tyr Leu Tyr Asp Gin Tyr Asp Gin Gin Val Gly Ala Asn 
420 425 430 

ACG ATA ATT AAG CCA GGA CTT CAA GCA TCG GTA GTA CGT GTG GAA GGC 1344 
Thr He lie Lys Pro Gly Leu Gin Ala Ser Val Val Arg Val Glu Gly 
435 440 445 

ACA AAT AAG GCA ATT GCT TCA ACA ATT GAT GGT GAA GCG CGT TAT GTA 1392 
Thr Asn Lys Ala He Ala Ser Thr He Asp Gly Glu Ala Arg Tyr Val 
450 455 460 

TAT AAC AAT CCA TAT GAA GGT GGA AAG ATG GTA GTA GCT GAA GCT TAT 1440 
Tyr Asn Asn Pro Tyr Glu Gly Gly Lys Met Val Val Ala Glu Ala Tyr 
465 470 475 480 

CGA AAT TTA ATT GCC GTG GGT GCA ACA CCA TTA GCA ATG ACA GAT TGT 14 88 

Arg Asn Leu He Ala Val Gly Ala Thr Pro Leu Ala Met Thr Asp Cys 
485 490 495 

TTA AAT TAT GGT TCT CCT GAA AAG AAA GAA ATC TAT CAA CAG TTG ATA 1536 
Leu Asn Tyr Gly Ser Pro Glu Lys Lys Glu He Tyr Gin Gin Leu He 
500 " 505 510 

GAT TCA ACG AAA GGT ATG GCA GAA GCA TGC GAC ATT CTT AAG ACA CCA 1584 
Asp Ser Thr Lys Gly Met Ala Glu Ala Cys Asp He Leu Lys Thr Pro 
515 * 520 525 

GTA GTT TCT GGT AAT GTA TCT TTA TAT AAC GAA ACG AAA GGT ACT TCT 1632 
Val Val Ser Gly Asn Val Ser Leu Tyr Asn Glu Thr Lys Gly Thr Ser 
530 535 540 



ATT TTC CCA ACA CCA GTT GTT GGA ATG GTA GGT TTG ATT GAA AAT GTA 
He Phe Pro Thr Pro Val Val Gly Met Val Gly Leu He Glu Asn Val 



1680 
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545 550 555 560 

AAT TAT TTA AAT GAT TTT GAA CCT CAA GTT GGA GAT AAA TTA TAT TTA 172 8 

Asn Tyr Leu Asn Asp Phe Glu Pro Gin Val Gly Asp Lys Leu Tyr Leu 
565 570 ' * 575 

ATC GGT GAT ACT AAG GAC GAC TTT GGT GGT AGT CAA CTT GAA AAG TTA 1776 
lie Gly Asp Thr Lys Asp Asp Phe Gly Gly Ser Gin Leu Glu Lys Leu 
580 585 ' 590 

ATT TAT GGC AAA GTT AAT CAT GAA TTT GAG TCA TTA GAT TTG AGT TCA 1824 
lie Tyr Gly Lys Val Asn His Glu Phe Glu Ser Leu Asp Leu Ser Ser 
595 600 605 

GAA GTT GAA AAA GGT GAA TCA ATC AAG ACC GCT ATT CGT GAA GGA CTA 1872 
Glu Val Glu Lys Gly Glu Ser He Lys Thr Ala He Arg Glu Gly Leu 
610 615 620 

TTA TCA CAT GTT CAA AC A GTT GGT AAA GGT GGC TTA CTG ATT ACC TTA 1920 
Leu Ser His Val Gin Thr Val Gly Lys Gly Gly Leu Leu He Thr Leu 
625 630 " 635 640 

GCT AAA CTA AGT GCG CAT TAC GGT TTA GGA TTA AAA TCT TCA ATA GAT 1968 
Ala Lys Leu Ser Ala His Tyr Gly Leu Gly Leu Lys Ser Ser He Asp 
645 " 650 655 

ATA ACA AAT GCA CAA TTG TTT AGT GAG ACG CAA GGC CGA TAT GTT GTT 2 016 

lie Thr Asn Ala Gin Leu Phe Ser Glu Thr Gin Gly Arg Tyr Val Val 
660 665 670 

TCT GTT AAA TCA GGT AAA ACT TTA AAT ATT GAT AAT GCA ATA GAA ATT 2064 
Ser Val Lys Ser Gly Lys Thr Leu Asn He Asp Asn Ala lie Glu He 
675 680 685 

GGA CTT TTA ACA GAT AGT GAT AAT TyC AAG GTA ACA ACA CCA TAT ACA 2112 
Gly Leu Leu Thr Asp Ser Asp Asn Xaa Lys Val Thr Thr Pro Tyr Thr 
690 695 700 

GAG ATT AGT GAA AAT GTT TCA GAT ATT AAA CAA ATA TGG GAA GGG GCA 2160 
Glu He Ser Glu Asn Val Ser Asp lie Lys Gin lie Trp Glu Gly Ala 
705 710 715 720 

ATT GCT CAA TGT TTA ACT ACT CAG GAT TAA 2190 

lie Ala Gin Cys Leu Thr Thr Gin Asp 
725 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 729 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Ser Lys Phe lie Glu Pro Ser Val Glu Glu lie Lys Leu Glu Lys 
15 10 15 

Val Tyr Gin Asp Met Gly Leu Ser Asp Gin Glu Tyr Glu Lys Val Cys 
20 25 30 

Asp lie Leu Gly Arg Gin Pro Asn Phe Thr Glu Thr Gly lie Phe Ser 



WO 99/01473 



-90- 



PCT/GB98/01974 



35 40 45 

Val Met Trp Ser Glu His Cys Ser Tyr Lys His Ser Lys Pro Phe Leu 
50 55 60 

Lys Gin Phe Pro Thr Ser Gly Asp His Val Leu Met Gly Pro Gly Glu 
65 70 75 ~ 80 

Gly Ala Gly Val Val Asp lie Gly Asp Asn Gin Ala Val Val Phe Lys 
85 90 95 

Val Glu Ser His Asn His Pro Ser Ala lie Glu Pro Tyr Gin Gly Ala 
100 105 110 

Ala Thr Gly Val Gly Gly He He Arg Asp He Val Ser He Gly Ala 
115 1 120 125 

Arg Pro He Asn Leu Leu Asn Ser Leu Arg Phe Gly Glu Leu Asp Asn 
130 135 140 

Lys Gin Asn Gin Arg Leu Leu Lys Gly Val Val Lys Gly He Gly Gly 
145 150 155 160 

Tyr Gly Asn Cys He Gly He Pro Thr Thr Ala Gly Glu He Glu Phe 
165 170 175 

Asp Glu Arg Tyr Asp Gly Asn Pro Leu Val Asn Ala Met Cys Val Gly 
180 185 190 

Val He Asn His Asp Met lie Gin Lys Gly Thr Ala Lys Gly Val Gly 
195 200 205 

Asn Ser Val He Tyr Val Gly Leu Lys Thr Gly Arg. Asp Gly He His 
210 215 220 

Gly Ala Thr Phe Ala Ser Glu Glu Leu Thr Glu Glu Ser Glu Ser Lys 
225 230 235 240 

Arg Pro Ser Val Gin He Gly Asp Pro Phe Val Gly Lys Lys Leu Met 
245 250 255 

Glu Ala Thr Leu Glu Ala lie Thr Phe Asp Glu Leu Val Gly lie Gin 
260 265 270 

Asp Met Gly Ala Ala Gly Leu Thr Ser Ser Ser Ser Glu Met Ala Ala 
275 280 285 

Lys Gly Gly Ser Gly Leu His Leu Arg Leu Glu Gin Val Pro Thr Arg 
290 ^ " 295 300 

Glu Pro Gly lie Ser Pro Tyr Glu Met Met Leu Ser Glu Thr Gin Glu 
305 310 ' 315 320 

Arg Met Leu Leu Val Val Glu Lys Gly Thr Glu Gin Lys Phe Leu Asp 
325 330 335 

Leu Phe Asp Lys His Glu Leu Asp Ser Ala Val lie Gly Glu Val Thr 
340 345 350 

Asp Thr Asn Arg Phe Val Leu Thr Tyr Asp Asp Glu Val Tyr Ala Asp 
355 360 365 

lie Pro Val Glu Pro Leu Ala Asp Glu Ala Pro Val Tyr lie Leu Glu 
370 375 380 
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Gly Glu Glu Lys Asp Tyr Asn Thr Ser Lys Asn Asp Tyr Thr His lie 
385 390 395 400 

Asp Val Lys Asp Thr Phe Phe Lys Leu Leu Lys His Pro Thr He Ala 
405 410 415 

Ser Lys His Tyr Leu Tyr Asp Gin Tyr Asp Gin Gin Val Gly Ala Asn 
420 425 430 

Thr He He Lys Pro Gly Leu Gin Ala Ser Val Val Arg Val Glu Gly 
435 440 445 

Thr Asn Lys Ala He Ala Ser Thr He Asp Gly Glu Ala Arg Tyr Val 
450 455 460 

Tyr Asn Asn Pro Tyr Glu Gly Gly Lys Met Val Val Ala Glu Ala Tyr 
465 470 475 480 

Arg Asn Leu He Ala Val Gly Ala Thr Pro Leu Ala Met Thr Asp Cys 
485 490 495 

Leu Asn Tyr Gly Ser Pro Glu Lys Lys Glu He Tyr Gin Gin Leu He 
500 505 510 

Asp Ser Thr Lys Gly Met Ala Glu Ala Cys Asp He Leu Lys Thr Pro 
515 520 525 

Val Val Ser Gly Asn Val Ser Leu Tyr Asn Glu Thr Lys Gly Thr Ser 
530 535 540 

lie Phe Pro Thr Pro Val Val Gly Met Val Gly Leu He Glu Asn Val 
545 550 555 560 

Asn Tyr Leu Asn Asp Phe Glu Pro Gin Val Gly Asp Lys Leu Tyr Leu 
565 570 575 

lie Gly Asp Thr Lys Asp Asp Phe Gly Gly Ser Gin Leu Glu Lys Leu 
580 ^ 585 590 

He Tyr Gly Lys Val Asn His Glu Phe Glu Ser Leu Asp Leu Ser Ser 
595 600 605 

Glu Val Glu Lys Gly Glu Ser He Lys Thr Ala He Arg Glu Gly Leu 
610 ' 615 620 

Leu Ser His Val Gin Thr Val Gly Lys Gly Gly Leu Leu He Thr Leu 
625 630 635 640 

Ala Lys Leu Ser Ala His Tyr Gly Leu Gly Leu Lys Ser Ser He Asp 
645 650 655 

He Thr Asn Ala Gin Leu Phe Ser Glu Thr Gin Gly Arg Tyr Val Val 
660 665 670 

Ser Val Lys Ser Gly Lys Thr Leu Asn He Asp Asn Ala He Glu He 
675 680 685 

Gly Leu Leu Thr Asp Ser Asp Asn Xaa Lys Val Thr Thr Pro Tyr Thr 
690 695 700 

Glu He Ser Glu Asn Val Ser Asp He Lys Gin He Trp Glu Gly Ala 
705 710 715 720 

Tie Ala Gin Cys Leu Thr Thr Gin Asp 

725 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 729 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " DNA (genomic) (pl0bl8/p6b!8 ) " 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..72 6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



ATG ACT AAA TTA TTT ATA CCT TAT ATT ATG GGC AAT AAA GAT TTG ATT 4 8 

Met Thr Lys Leu Phe lie Pro Tyr lie Met Gly Asn Lys Asp Leu lie 
15 10 15 

GAA AAT GCA ACA TTG TTG AGT GAA AAT GGT GCA GAT ATA ATT GAA ATT 96 
Glu Asn Ala Thr Leu Leu Ser Glu Asn Gly Ala Asp lie lie Glu lie 
20 25 30 

GGA GTA CCT TTC TCT GAT CCG GTT GCT GAT GGT CCA GTT ATC ATG GAA 144 
Gly Val Pro Phe Ser Asp Pro Val Ala Asp Gly Pro Val lie Met Glu 
35 40 45 

GCA GGT CAA CAA GCG ATT AAA CAA GGC ATC ACG ATA GAT TAT ATT TTC 192 
Ala Gly Gin Gin Ala lie Lys Gin Gly lie Thr lie Asp Tyr He Phe 
50 55 60 

AAT CAA TTA GAA AAA CAT GGT GAT CAA ATT AAG TGT AAC TAT GTA TTA 240 
Asn Gin Leu Glu Lys His Gly Asp Gin He Lys Cys Asn Tyr Val Leu 
65 70 75 80 

ATG ACG TAT TAT AAT ATT ATT TGT CAT TAT GGA GAA CAA GCG TTT TTT 2 88 

Met Thr Tyr Tyr Asn He He Cys His Tyr Gly Glu Gin Ala Phe Phe 
85 90 95 

GAA AAA TGT CGA GAT ACT GGT GTC TAC GGC TTA ATT ATT CCT GAT TTA 336 
Glu Lys Cys Arg Asp Thr Gly Val Tyr Gly Leu He He Pro Asp Leu 
100 105 110 

CCA TAT GAA TTA TCG CAG CGT TTA AAA CAA CAA TTT AGT CAC TAT GGC 384 
Pro Tyr Glu Leu Ser Gin Arg Leu Lys Gin Gin Phe Ser His Tyr Gly 
115 120 125 

GTC AAA ATC ATA TCG TTA GTT GCG ATG ACT ACT GAT GAC AAA CGT ATA 432 
Val Lys He He Ser Leu Val Ala Met Thr Thr Asp Asp Lys Arg He 
130 135 140 

AAA GAT ATC GTA TCC CAT GCG GAA GGC TTT ATT TAT ACT GTG ACG ATG 4 80 

Lys Asp He Val Ser His Ala Glu Gly Phe He Tyr Thr Val Thr Met 
145 150 155 160 

AAT GCG ACA ACA GGG CAA AAC GGT GCG TTT CAT CCA GAA TTA AAA CGA 52 8 

Asn Ala Thr Thr Glv Gin Asn Glv Ala Phe His Pro Glu Leu Lys Arg 
165 170 175 
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AAA ATT GAG TCA ATT AAA GCG ATA GCC AAT GTG CCA GTT GTC GCA GGA 576 
Lys lie Glu Ser lie Lys Ala He Ala Asn Val Pro Val Val Ala Gly 
180 185 190 

TTT GGT ATA AGA ACA CCA CAA CAT GTT GCA GAT ATA AAA GAG GTT GCA 624 
Phe Gly He Arg Thr Pro Gin His Val Ala Asp He Lys Glu Val Ala 
195 200 205 

GAT GGC ATT GTC ATT GGT AGC GAA ATC GTT AAG CGA TTT AAA TCT AAC 672 
Asp Gly He Val He Gly Ser Glu He Val Lys Arg Phe Lys Ser Asn 
210 215 220 

ACG CGT GAG GAA ATC ATT AAA TAT TTA CAA TCT ATC CAA CAA ACA TTG 720 
Thr Arg Glu Glu He He Lys Tyr Leu Gin Ser He Gin Gin Thr Leu 
225 230 235 240 

AAT AAT TAA 72 9 

Asn Asn 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 42 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Thr Lys Leu Phe He Pro Tyr He Met Gly Asn Lys Asp Leu He 
15 10 15 

Glu Asn Ala Thr Leu Leu Ser Glu Asn Gly Ala Asp lie lie Glu lie 
20 25 30 

Gly Val Pro Phe Ser Asp Pro Val Ala Asp Gly Pro Val He Met Glu 
35 40 45 

Ala Gly Gin Gin Ala He Lys Gin Gly He Thr He Asp Tyr lie Phe 
50 55 60 

Asn Gin Leu Glu Lys His Gly Asp Gin lie Lys Cys Asn Tyr Val Leu 
65 70 75 80 

Met Thr Tyr Tyr Asn lie lie Cys His Tyr Gly Glu Gin Ala Phe Phe 
85 90 95 

Glu Lys Cys Arg Asp Thr Gly Val Tyr Gly Leu He lie Pro Asp Leu 
100 105 110 

Pro Tyr Glu Leu Ser Gin Arg Leu Lys Gin Gin Phe Ser His Tyr Gly 
115 120 125 

Val Lys lie lie Ser Leu Val Ala Met Thr Thr Asp Asp Lys Arg lie 
130 135 140 

Lys Asp lie Val Ser His Ala Glu Gly Phe lie Tyr Thr Val Thr Met 
145 150 155 160 

Asn Ala Thr Thr Gly Gin Asn Gly Ala Phe His Pro Glu Leu Lvs Arg 
165 170 175 
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Lys lie Glu Ser lie Lys Ala He 
180 

Phe Gly He Arg Thr Pro Gin His 

195 200 

Asp Gly He Val He Gly Ser Glu 
210 215 

Thr Arg Glu Glu He He Lys Tyr 
225 230 

Asn Asn 



Ala Asn Val Pro Val Val Ala Gly 
185 190 

Val Ala Asp He Lys Glu Val Ala 
205 

He Val Lys Arg Phe Lys Ser Asn 
220 

Leu Gin Ser He Gin Gin Thr Leu 
235 240 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 1215 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (pl0b66) " 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

<B) LOCATION: 1..1212 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

ATG AAT AAA CAA ATA CAA ACA GAA GCA GAT GAA TTA GGT TTC TTT GGT 4 8 

Met Asn Lys Gin He Gin Thr Glu Ala Asp Glu Leu Gly Phe Phe Gly 
15 10 15 

GAA TAT GGA GGG CAA TAT GTT CCA GAA ACA TTA ATG CCA GCA ATT ATT 96 
Glu Tyr Gly Gly Gin Tyr Val Pro Glu Thr Leu Met Pro Ala He He 
20 25 30 

GAG TTG AAA AAA GCT TAT AAA GAG GCA AAA GCA GAC CCA GAG TTT CAA 144 
Glu Leu Lys Lys Ala Tyr Lys Glu Ala Lys Ala Asp Pro Glu Phe Gin 
35 40 45 

AGA GAA CTG GAA TAC TAT TTA TCA GAG TAT GTA GGA CGC GCG ACA CCA 192 
Arg Glu Leu Glu Tyr Tyr Leu Ser Glu Tyr Val Gly Arg Ala Thr Pro 
50 55 60 

CTT ACA TAT GCT GCA TCA TAT ACT GAA AGC CTA GGT GGC GCT AAA ATA 24 0 

Leu Thr Tyr Ala Ala Ser Tyr Thr Glu Ser Leu Gly Gly Ala Lys He 
65 70 75 ' 80 

TAT TTG AAA CGA GAG GAT CTA AAT CAT ACA GGC GCC CAT AAA ATT AAT 2 88 

Tyr Leu Lys Arg Glu Asp Leu Asn His Thr Gly Ala His Lys He Asn 
85 90 95 

AAT GCG TTA GGT CAA GCG TTG CTT GCT AAA AGA ATG GGC AAG AAG AAG 3 36 

Asn Ala Leu Gly Gin Ala Leu Leu Ala Lys Arg Met Gly Lys Lys Lys 
100 105 110 



CTT GTT GCT GAA ACT GGT GCG GGT CAA CAT GGT GTA GCT AGT GCT ACG 
Leu Val Ala Glu Thr Gly Ala Gly Gin His Gly Val Ala Ser Ala Thr 



384 
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115 










120 










125 










GTT 


GCT 


GCA 


TTA 


TTT 


GAT 


ATG 


GAA 


CTT 


GTT 


GTC 


TTT 


ATG 


GGA 


AGT 


GAA 


432 


val 


7V "T _ 

Ala 


Ala 


Leu 


Phe 


Asp 


Met 


Glu 


Leu 


Val 


Val 


Phe 


Met 


Gly 


Ser 


Glu 




- 


130 










135 










140 










GAT 


ATT 


AAA 


AGA 


CAA 


CAA 


CTT 


AAT 


GTA 


TTT 


AGA 


ATG 


GAA 


TTA 


CTT 


GGT 


480 


Asp 


lie 


Lys 


Arg 


Gin 


Gin 


Leu 


Asn 


Val 


Phe 


Arg Met 


Glu 


Leu 


Leu 


Gly 




145 










150 










155 










160 




GCA 


AAG 


GTT 


GTG 


GCA 


GTT 


GAA 


GAT 


GGT 


CAA 


GGG 


ACT 


TTA 


TCG 


GAT 


GCA 


528 


Ala 


Lys 


Val 


Val 


Ala 


Val 


Glu 


Asp 


Gly Gin Gly Thr 


Leu 


Ser 


Asd 


Ala 












165 










170 










175 






GTT 


AAT 


AAA 


GCA 


TTG 


CAA 


TAT 


TGG 


GTA 


AGT 


CAT 


GTA 


GAT 


GAT 


ACA 


CAT 


576 


Val 


Asn 


Lys 


Ala 


Leu 


Gin 


Tyr 


Trp 


Val 


Ser 


His 


Val 


Asp 


Asp 


Thr 


His 










180 










185 










190 








TAT 


TTA 


TTA 


GGT 


TCT 


GCA 


TTA 


GGT 


CCA 


GAC 


CCG 


TTC 


CCA 


ACG 


ATT 


GTT 


624 


Tyr 


Leu 


Leu 


Gly 


Ser 


Ala 


Leu 


Glv 


Pro Asp 


Pro 


Phe 


Pro 


Thr 


lie 


Val 








195 










200 










205 










AGA 


GAT 


TTT 


CAG 


AGT 


GTG 


ATT 


GGT 


AAA 


GAA 


ATA 


AAA 


TCA 


CAG 


ATA 


TTG 


672 


Arg 


Asp 


Phe 


Gin 


Ser 


Val 


He 


Gly 


Lys 


Glu 


He 


Lys 


Ser 


Gin 


lie 


Leu 






210 










215 










220 












AAG 


AAA 


GAA 


GGT 


CGA 


CTT 


CCG 


GAT 


GCA 


ATT 


GTA 


GCA 


TGT 


ATC 


GGT 


GGT 


. 720 


Lys 


Lys 


Glu 


Gly 


Arg 


Leu 


Pro 


Asp 


Ala 


lie 


Val 


Ala 


Cys 


He 


Gly 


Gly 


225 










230 










235 










24 0 




GGC 


TCA 


AAT 


GCA 


ATC 


GGT 


ACA 


TTT 


TAT 


CCA 


TTT 


ATT 


AAA 


GAT 


GAT 


GTT 


768 


Gly 


Ser 


Asn 


Ala 


He 


Gly 


Thr 


Phe 


Tyr 


Pro 


Phe 


lie 


Lys 


Asp 


Asp 


Val 












245 










250 










255 






GCA 


TTA 


TAC 


GGT 


GTT 


GAA 


GCC 


GCA 


GGT 


CAA 


GGC 


GAT 


GAT 


ACT 


GAT 


AAA 


816 


Ala 


Leu 


Tyr 


Gly 


Val 


Glu 


Ala 


Ala 


Gly Gin Gly Asp 


Asp 


Thr 


Asp 


Lys 










260 










265 










270 








CAT 


GCA 


CTT 


GCA 


ATT 


GGC 


AAA 


GGA 


TCA 


CCT 


GGC 


GTA 


TTA 


CAT 


GGT 


ACT 


864 


His 


Ala 


Leu 


Ala 


He 


Gly 


Lys 


Gly 


Ser Pro Gly Val 


Leu 


His 


Gly 


Thr 








275 










280 










285 










AAA 


ATG 


TAT 


TTA 


ATT 


CAA 


GAT 


GAA 


GAT 


GGG 


CAA 


GTG 


CAA 


CTA 


GCA 


CAT 


912 


Lys 


Met 


Tyr 


Leu 


He 


Gin 


Asp 


Glu 


Asp Gly Gin Val 


Gin 


Leu 


Ala 


His 






290 










295 










300 












TCT 


ATT 


TCA 


GCA 


GGA 


CTT 


GAT 


TAT 


CCT 


GGT 


ATT 


GGA 


CCA 


GAA 


CAT 


TCT 


960 


Ser 


lie 


Ser 


Ala 


Gly 


Leu 


Asp 


Tyr 


Pro Gly 


lie Gly 


Pro 


Glu 


His 


Ser 




305 










310 










315 










320 




TAT 


TAC 


CAC 


GAC 


ATT 


GGT 


AGA 


GTA 


ACT 


TTT 


GAA 


AAT 


GCT 


AGT 


GAT 


ACA 


1008 


Tyr 


Tyr 


His 


Asp 


He 


Gly 


Arg 


Val 


Thr 


Phe 


Glu 


Asn 


Ala 


Ser 


Asp 


Thr 












325 










330 










335 






CAA 


GCA 


ATG 


AAT 


GCT 


TTA 


ATC 


AAC 


TTT 


ACA 


AAA 


CAT 


GAA 


GGT 


ATT 


ATA 


1056 


Gin 


Ala 


Met 


Asn 


Ala 


Leu 


He 


Asn 


Phe 


Thr 


Lys 


His 


Glu 


Gly 


lie 


lie 










340 










345 










350 








CCT 


GCA 


ATT 


GAA 


AGT 


GCA 


CAT 


GCA 


CTG 


AGT 


TAT 


GTT 


GAA 


AGA 


CTA 


GCG 


1104 


Pro 


Ala 


lie 


Glu 


Ser 


Ala 


His 


Ala 


Leu 


Ser 


Tyr 


Val 


Glu 


Arg 


Leu 


Ala 








355 










360 










365 









CCT ACG ATG TCG AAA GAA GAT ATT ATT GTA GTA ACT ATT TCT GGA CGT 
Pro Thr Met Ser Lys Glu Asp lie lie Val Val Thr lie Ser Gly Arg 
370 375 380 
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GGC GAT AAA GAT ATG GAA ACA ATT AGA CAA TAT ATG GTA GAG CGA GGT 1200 
Gly Asp Lys Asp Met Glu Thr lie Arg Gin Tyr Met Val Glu Arg Gly 
385 390 395 400 

CTT GCA AAT GAC TAA 12 i5 
Leu Ala Asn Asp 



(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 04 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

Met Asn Lys Gin lie Gin Thr Glu Ala Asp Glu Leu Gly Phe Phe Gly 
1 5 10 15 

Glu Tyr Gly Gly Gin Tyr Val Pro Glu Thr Leu Met Pro Ala lie He 
20 25 30 

Glu Leu Lys Lys Ala Tyr Lys Glu Ala Lys Ala Asp Pro Glu Phe Gin 
35 40 45 

Arg Glu Leu Glu Tyr Tyr Leu Ser Glu Tyr Val Gly Arg Ala Thr Pro 
50 55 60 

Leu Thr Tyr Ala Ala Ser Tyr Thr Glu Ser Leu Gly Gly Ala Lys He 
65 70 75 80 

Tyr Leu Lys Arg Glu Asp Leu Asn His Thr Gly Ala His Lys He Asn 
85 90 95 

Asn Ala Leu Gly Gin Ala Leu Leu Ala Lys Arg Met Gly Lys Lys Lys 
100 105 * 110 

Leu Val Ala Glu Thr Gly Ala Gly Gin His Gly Val Ala Ser Ala Thr 
115 120 125 

Val Ala Ala Leu Phe Asp Met Glu Leu Val Val Phe Met Gly Ser Glu 
130 135 140 

Asp He Lys Arg Gin Gin Leu Asn Val Phe Arg Met Glu Leu Leu Gly 
145 ~ 150 155 160 

Ala Lys Val Val Ala Val Glu Asp Gly Gin Gly Thr Leu Ser Asp Ala 
165 170 175 

Val Asn Lys Ala Leu Gin Tyr Trp Val Ser His Val Asp Asp Thr His 
180 ' 185 190 

Tyr Leu Leu Gly Ser Ala Leu Gly Pro Asp Pro Phe Pro Thr He Val 
195 200 205 

Arg Asp Phe Gin Ser Val He Gly Lys Glu He Lys Ser Gin He Leu 
210 215 220 

Lys Lys Glu Gly Arg Leu Pro Asp Ala He Val Ala Cys He Gly Gly 
225 230 235 240 
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Gly Ser Asn Ala lie Gly Thr Phe Tyr Pro Phe lie Lys Asp Asp Val 
245 250 255 

Ala Leu Tyr Gly Val Glu Ala Ala Gly Gin Gly Asp Asp Thr Asp Lys 
260 265 270 

His Ala Leu Ala He Gly Lys Gly Ser Pro Gly Val Leu His Gly Thr 
275 280 285 

Lys Met Tyr Leu He Gin Asp Glu Asp Gly Gin Val Gin Leu Ala His 
290 295 300 

Ser He Ser Ala Gly Leu Asp Tyr Pro Gly He Gly Pro Glu His Ser 
305 310 315 320 

Tyr Tyr His Asp He Gly Arg Val Thr Phe Glu Asn Ala Ser Asp Thr 
325 330 335 

Gin Ala Met Asn Ala Leu He Asn Phe Thr Lys His Glu Gly He He 
340 345 350 

Pro Ala lie Glu Ser Ala His Ala Leu Ser Tyr Val Glu Arg Leu Ala 
355 360 365 

Pro Thr Met Ser Lys Glu Asp He He Val Val Thr He Ser Gly Arg 
370 375 380 

Gly Asp Lys Asp Met Glu Thr lie Arg Gin Tyr Met Val Glu Arg Gly 
385 390 395 400 

Leu Ala Asn Asp 

,(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1008 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (pl0c34) H 
Unknown = Xaa 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. .1005 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

ATG AAA ACC ATG ACA TTA CTA ACA AGA ATA AAA ACT GAA ACT ATA TTA 4 8 

Met Lys Thr Met Thr Leu Leu Thr Arg He Lys Thr Glu Thr He Leu 
1 5 10 15 

CTT GAA AGC GAC ATT AAA GAG CTA ATC GAT ATA CTT ATT TCT CCT AGT 96 
Leu Glu Ser Asp lie Lys Glu Leu He Asp lie Leu lie Ser Pro Ser 
20 25 30 

ATT GGA ACT GAT ATT AAA TAT GAA TTA CTT AGT TCC TAT TCG GAG CGA 144 
He Gly Thr Asp lie Lys Tyr Glu Leu Leu Ser Ser Tyr Ser Glu Arg 
35 40 45 



GAA ATC CAA CAA CAA GAA TTA ACA TAT ATT GTA CGT AGC TTA ATT AAT 192 
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Glu lie Gin Gin Gin Glu Leu Thr Tyr He Val Arg Ser Leu He Asn 

50 55 60 

ACA ATG TAT CCA CAT CAA CCA TGT TAT GAA GGG GCT ATG TGT GTG TGC 24 0 

Thr Met Tyr Pro His Gin Pro Cys Tyr Glu Gly Ala Met Cys Val Cys 
65 70 75 80 

GGC ACA GGT GGT GAC AAG TCA AAT AGT TTC AAC ATT TCA ACG ACT GTT 2 88 

Gly Thr Gly Gly Asp Lys Ser Asn Ser Phe Asn He Ser Thr Thr Val 
85 90 95 

GCT TTT GTT GTA GCA AGT GCT GGC GTA AAA GTT ATA AAA CAT GGT AAT ■ 3 36 

Ala Phe Val Val Ala Ser Ala Gly Val Lys Val He Lys His Gly Asn 
100 105 110 

AAA AGT ATT ACC TCA AAT TCA GGT AGT ACG GAT TTG TTA AAT CAA ATG 3 84 

Lys Ser He Thr Ser Asn Ser Gly Ser Thr Asp Leu Leu Asn Gin Met 
115 120 125 

AAC ATA CAA ACA ACA ACT GTT GAT GAT ACA CCT AAC CAA TTA AAT GAn 4 32 

Asn He Gin Thr Thr Thr Val Asp Asp Thr Pro Asn Gin Leu Asn Xaa 
130 135 140 

AAA GAC CTT GTA TTC ATT GGT GCA ACT GAA TCA TAT CCA ATC ATG AAG 4 80 

Lys Asp Leu Val Phe lie Gly Ala Thr Glu Ser Tyr Pro He Met Lys 
145 " 150 155 160 

TAT ATG CAA CCA GTT AGA AAA ATG ATT GGA AAG CCT ACA ATA TTA AAC 52 8 

Tyr Met Gin Pro Val Arg Lys Met lie Gly Lys Pro Thr He Leu Asn 
165 170 175 

CTT GTG GGT CCA TTA ATT AAT CCA TAT CAC TTA ACG TAT CAA ATG GTA 576 
Leu Val Gly Pro Leu lie Asn Pro Tyr His Leu Thr Tyr Gin Met Val 
180 185 190 

GGC GTC TTT GAT CCT ACA AAG TTA AAG TTA GTT GCT AAA ACG ATT AAA 624 
Gly Val Phe Asp Pro Thr Lys Leu Lys Leu Val Ala Lys Thr He Lys 
195 200 205 

GAT TTA GGT AGA AAA CGT GCA ATC GTT TTA CAT GGT GCA AAT GGT ATG 672 
Asp Leu Gly Arg Lys Arg Ala lie Val Leu His Gly Ala .Asn Gly Met 
210 215 220 

GAT GAA GCA ACA CTA TCT GGT GAT AAT TTG ATA TAT GAA TTG ACT GAA 72 0 

Asp Glu Ala Thr Leu Ser Gly Asp Asn Leu He Tyr Glu Leu Thr Glu 
225 230 235 240 

GAT GGA GAA ATC AAA AAT TAC ACA TTA AAT GCG ACT GAT TAT GGT TTG 768 
Asp Gly Glu lie Lys Asn Tyr Thr Leu Asn Ala Thr Asp Tyr Gly Leu 
245 250 255 

AAA CAT GCG CCG AAT AGT GAT TTT AAA GGC GGT TCA CCT GAA GAA AAT 816 
Lys His Ala Pro Asn Ser Asp Phe Lys Gly Gly Ser Pro Glu Glu Asn 
260 265 270 

TTA GCA ATC TCC CTT AAT ATC TTG AAT GGT AAA GAT CAG TCA rGT CGA 864 
Leu Ala He Ser Leu Asn He Leu Asn Gly Lys Asp Gin Ser Xaa Arg 
275 280 285 

CGT GAT GTT GTC TTA CTA AAT GCG GGT TTA AGC CTT TAT GTT GCA GAG 912 
Arg Asp Val Val Leu Leu Asn Ala Gly Leu Ser Leu Tyr Val Ala Glu 
290 295 300 



AAA rTG GAT ACC ATC GCA GAA GGC ATA GAA CTT GCA ACT ACA TTG ATT 
Lys Xaa Asp Thr lie Ala Glu Gly lie Glu Leu Ala Thr Thr Leu lie 



960 



WO 99/01473 



PCT/GB98/01974 



-99- 

305 310 315 320 

GAT AAT GGT GAA GCA TTG GAA AAA TAC CAT CAA ATG AGA GGT GAA 1005 
Asp Asn Gly Glu Ala Leu Glu Lys Tyr His Gin Met Arg Gly Glu 
325 330 335 

TAA 1008 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Lys Thr Met Thr Leu Leu Thr Arg lie Lys Thr Glu Thr lie Leu 
1-5 10 15 

Leu Glu Ser Asp lie Lys Glu Leu He Asp He Leu He Ser Pro Ser 
20 25 30 

He Gly Thr Asp He Lys Tyr Glu Leu Leu Ser Ser Tyr Ser Glu Arg 
35 40 45 

Glu He Gin Gin Gin Glu Leu Thr Tyr He Val Arg Ser Leu He Asn 
50 55 60 

Thr Met Tyr Pro His Gin Pro Cys Tyr Glu Gly Ala Met Cys Val Cys 
65 70 75 80 

Gly Thr Gly Gly Asp Lys Ser Asn Ser Phe Asn He Ser Thr Thr Val 
85 90 95 

Ala Phe Val Val Ala Ser Ala Gly Val Lys Val He Lys His Gly Asn 
100 105 110 

Lys Ser He Thr Ser Asn Ser Gly Ser Thr Asp Leu Leu Asn Gin Met 
115 120 125 

Asn He Gin Thr Thr Thr Val Asp Asp Thr Pro Asn Gin Leu Asn Xaa 
130 135 140 

Lys Asp Leu Val Phe He Gly Ala Thr Glu Ser Tyr Pro He Met Lys 
145 150 155 160 



Tyr Met Gin Pro Val Arg Lys Met 
165 

Leu Val Gly Pro Leu He Asn Pro 
180 

Gly Val Phe Asp Pro Thr Lys Leu 

195 200 

Asp Leu Gly Arg Lys Arg Ala He 
210 215 

Asp Glu Ala Thr Leu Ser Gly Asp 
225 230 



He Gly Lys Pro Thr lie Leu Asn 
170 175 

Tyr His Leu Thr Tyr Gin Met Val 
185 190 

Lys Leu Val Ala Lys Thr lie Lys 
205 

Val Leu His Gly Ala Asn Gly Met 
220 

Asn Leu lie Tyr Glu Leu Thr Glu 
235 240 
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Asp Gly Glu He Lys Asn Tyr Thr Leu Asn Ala Thr Asp Tyr Gly Leu 
245 250 255 

Lys His Ala Pro Asn Ser Asp Phe Lys Gly Gly Ser Pro Glu Glu Asn 
260 265 * 270 

Leu Ala He Ser Leu Asn He Leu Asn Gly Lys Asp Gin Ser Xaa Arg 
275 280 285 

Arg Asp Val Val Leu Leu Asn Ala Gly Leu Ser Leu Tyr Val Ala Glu 
290 295 300 

Lys Xaa Asp Thr He Ala Glu Gly lie Glu Leu Ala Thr Thr Leu He 
305 310 315 320 

Asp Asn Gly Glu Ala Leu Glu Lys Tyr His Gin Met Arg Gly Glu 
325 330 335 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1254 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (p4c27) » 

{ ix ) FEATURE : 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. .1251 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

ATG GCG GAA ATT AAG GTA CCT GAA TTA GCA GAA TCA ATC TCA GAA GGA 4 8 

Met Ala Glu He Lys Val Pro Glu Leu Ala Glu Ser He Ser Glu Gly 
15 10 15 

ACA ATA GCC CAA TGG TTA AAG CAG CCT GGT GAC TAT GTA GAA CAG GGT 96 
Thr He Ala Gin Trp Leu Lys Gin Pro Gly Asp Tyr Val Glu Gin Gly 
20 25 30 

GAA TAT CTG CTT GAA CTA GAA ACG GAT AAA GTG AAT GTT GAA TTG ACA 144 
Glu Tyr Leu Leu Glu Leu Glu Thr Asp Lys Val Asn Val Glu Leu Thr 
35 40 45 

GCA GAA GAA TCG GGT GTA CTT CAA GAG GTA TTG AAA GAT TCG GGT GAT 192 
Ala Glu Glu Ser Gly Val Leu Gin Glu Val Leu Lys Asp Ser Gly Asp 
50 .55 60 

ACC GTC CAG GTC GGA GAA ATT ATC GGT ACG ATT TCA GAA GGC GCG GGT 24 0 

Thr Val Gin Val Gly Glu He lie Gly Thr lie Ser Glu Gly Ala Gly 
65 70 75 80 

GAA AGT TCT GCC CCT GCT CCT ACA GAG AAA ACA GAA AGC AAG GAA AGC 288 
Glu Ser Ser Ala Pro Ala Pro Thr Glu Lys Thr Glu Ser Lys Glu Ser 
85 90 95 

GTA AAA GAA GAG AAA CAG GCT GAA CCA GCT GCA CAA GAG GTG AGC GAG 336 
Val Lys Glu Glu Lys Gin Ala Glu Pro Ala Ala Gin Glu Val Ser Glu 
100 105 110 
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GAA GCA CAA TCT GAA GCA AAA TCA AGA ACG ATC GCT TCT CCG TCG GCC 384 

Glu Ala Gin Ser Glu Ala Lys Ser Arg Thr lie Ala Ser Pro Ser Ala 
115 120 125 

CGT AAG CTT GCG AGA GAA AAA GGA ATT GAC CTG TCT CAA GTT CCA ACT 4 32 

Arg Lys Leu Ala Arg Glu Lys Gly He Asp Leu Ser Gin Val Pro Thr 
130 135 140 

GGA GAT CCG CTT GGA AGA GTG CGC AAG CAG GAT GTC GAA GCG TAC GAA 4 80 

Gly Asp Pro Leu Gly Arg Val Arg Lys Gin Asp Val Glu Ala Tyr Glu 
145 150 155 160 

AAA CCG GCA TCA AAA CCG GCT CCT CAG CAA AAG CAG CAG CCT CAG GCT 52 8 

Lys Pro Ala Ser Lys Pro Ala Pro Gin Gin Lys Gin Gin Pro Gin Ala 
165 170 175 

CAA AAA GCA CAG CAA AGC TTT GAC AAA CCT GTT GAA GTG CAA AAA ATG 576 

Gin Lys Ala Gin Gin Ser Phe Asp Lys Pro Val Glu Val Gin Lys Met 

180 185 190 

TCA CGC CGC AGA CAA ACG ATT GCA AAA CGC CTT GTA GAG GTA CAG CAA 624 

Ser Arg Arg Arg Gin Thr He Ala Lys Arg Leu Val Glu Val Gin Gin 
195 200 205 

ACA TCA GCG ATG CTG ACT ACA TTT AAT GAA GTG GAC ATG ACG GCT GTC 672 

Thr Ser Ala Met Leu Thr Thr Phe Asn Glu Val Asp Met Thr Ala Val 
210 215 220 

ATG AAT CTC AGA AAA CGC CGC AAA GAT CAA TTT TTT GAG CAA AAT GAA 72 0 

Met Asn Leu Arg Lys Arg Arg Lys Asp Gin Phe Phe Glu Gin Asn Glu 
225 230 235 240 

GTG AAG CTC GGC TTT ATG TCT TTC TTC ACG AAA GCG GTC GTG GCT GCA 768 

Val Lys Leu Gly Phe Met Ser Phe Phe Thr Lys Ala Val Val Ala Ala 
245 250 255 

TTG AAA AAA TAT CCG CTG TTG AAT GCA GAA ATT CAA GGC GAT GAG TTG 816 

Leu Lys Lys Tyr Pro Leu Leu Asn Ala Glu He Gin Gly Asp Glu Leu 

260 265 270 

ATC GTT AAA AAA TTC TAC GAT ATC GGA ATC GCT GTT GCT GCT GTA GAA 864 

He Val Lys Lys Phe Tyr Asp He Gly He Ala Val Ala Ala Val Glu 
275 280 285 

GGT CTT GTC GTT CCG GTT GTA CGG GAT GCG GAT CGC CTG ACA TTT GCA 912 

Gly Leu Val Val Pro Val Val Arg Asp Ala Asp Arg Leu Thr Phe Ala 
290 295 300 

GGA ATC GAA AAA GAG ATC GGC GAG CTT GCG AAA AAA GCA AGA AAC AAT 960 

Gly He Glu Lys Glu He Gly Glu Leu Ala Lys Lys Ala Arg Asn Asn 
305 310 315 320 

AAA TTA ACC CTT AGC GAG CTT GAG GGA GGC TCC TTC ACG ATT ACA AAC 1008 

Lys Leu Thr Leu Ser Glu Leu Glu Gly Gly Ser Phe Thr He Thr Asn 
325 330 335 



GGA GGG ACT TTC GGT TCA TTG ATG TCA ACT CCA ATT TTA AAC AGC CCG 1056 
Gly Gly Thr Phe Gly Ser Leu Met Ser Thr Pro He Leu Asn Ser Pro 
340 345 350 

CAA GTC GGT ATA CTG GGC ATG CAT AAG ATT CAG CTG CGC CCT GTA GCC 1104 
Gin Val Gly He Leu Gly Met His Lys He Gin Leu Arg Pro Val Ala 
355 360 365 
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ATT GAT GAA GAG CGT TTC GAA AAC CGT CCG ATG ATG TAT ATC GCT TTA 1152 

lie Asp Glu Glu Arg Phe Glu Asn Arg Pro Met Met Tyr lie Ala Leu 

370 375 380 

TCT TAT GAT CAC CGA ATT GTA GAC GGT AAA GAA GCG GTT GGT TTC CTC 12 00 

Ser Tyr Asp His Arg lie Val Asp Gly Lys Glu Ala Val Gly Phe Leu 
385 390 395 400 

GTG AC A ATC AAA AAT TTA CTG GAA GAT CCT GAA CAG CTT TTA TTA GAA 1248 
Val Thr lie Lys Asn Leu Leu Glu Asp Pro Glu Gin Leu Leu Leu Glu 
405 410 415 

GGA TAA 1254 
Gly 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met Ala Glu He Lys Val Pro Glu Leu Ala Glu Ser He Ser Glu Gly 
15 10 15 

Thr He Ala Gin Trp Leu Lys Gin Pro Gly Asp Tyr Val Glu Gin Gly 
20 25 30 

Glu Tyr Leu Leu Glu Leu Glu Thr Asp Lys Val Asn Val Glu Leu Thr 
35 40 45 

Ala Glu Glu Ser Gly Val Leu Gin Glu Val Leu Lys Asp Ser Gly Asp 
50 55 60 

Thr Val Gin Val Gly Glu He He Gly Thr He Ser Glu Gly Ala Gly 
65 70 75 80 

Glu Ser Ser Ala Pro Ala Pro Thr Glu Lys Thr Glu Ser Lys Glu Ser 
85 90 95 

Val Lys Glu Glu Lys Gin Ala Glu. Pro Ala Ala Gin Glu Val Ser Glu 
100 105 110 

Glu Ala Gin Ser Glu Ala Lys Ser Arg Thr He Ala Ser Pro Ser Ala 
115 120 125 

Arg Lys Leu Ala Arg Glu Lys Gly He Asp Leu Ser Gin Val Pro Thr 
130 135 140 

Gly Asp Pro Leu Gly Arg Val Arg Lys Gin Asp Val Glu Ala Tyr Glu 
145 150 155 160 

Lys Pro Ala Ser Lys Pro Ala Pro Gin Gin Lys Gin Gin Pro Gin Ala 
165 170 175 

Gin Lys Ala Gin Gin Ser Phe Asp Lys Pro Val Glu Val Gin Lys Met 
180 185 190 
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Ser Arg Arg Arg Gin Thr He Ala Lys Arg Leu Val Glu Val Gin Gin 
195 200 205 

Thr Ser Ala Met Leu Thr Thr Phe Asn Glu Val Asp Met Thr Ala Val 
210 215 220 

Met Asn Leu Arg Lys Arg Arg Lys Asp Gin Phe Phe Glu Gin Asn Glu 
225 230 235 240 

Val Lys Leu Gly Phe Met Ser Phe Phe Thr Lys Ala Val Val Ala Ala 
245 250 255 

Leu Lys Lys Tyr Pro Leu Leu Asn Ala Glu He Gin Gly Asp Glu Leu 
260 265 270 

He Val Lys Lys Phe Tyr Asp He Gly He Ala Val Ala Ala Val Glu 
275 280 285 

Gly Leu Val Val Pro Val Val Arg Asp Ala Asp Arg Leu Thr Phe Ala 
290 295 300 

Gly He Glu Lys Glu He Gly Glu Leu Ala Lys Lys Ala Arg Asn Asn 
305 310 315 320 

Lys Leu Thr Leu Ser Glu Leu Glu Gly Gly Ser Phe Thr He Thr Asn 
325 330 335 

Gly Gly Thr Phe Gly Ser Leu Met Ser Thr Pro He Leu Asn Ser Pro 
340 345 350 

Gin Val Gly He Leu Gly Met His Lys lie Gin Leu Arg Pro Val Ala 
355 360 365 

He Asp Glu Glu Arg Phe Glu Asn Arg Pro Met Met Tyr lie Ala Leu 
370 375 " 380 

Ser Tyr Asp His Arg lie Val Asp Gly Lys Glu Ala Val Gly Phe Leu 
385 390 395 400 

Val Thr lie Lys Asn Leu Leu Glu Asp Pro Glu Gin Leu Leu Leu Glu 
405 410 415 

Gly 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1254 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = M DNA (genomic) (p4c52)" 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . .1251 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
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ATG GCG GAA ATT AAG GTA CCT GAA TTA GCA GAA TCA ATC TCA GAA GGA 4 8 

Met Ala Glu lie Lys Val Pro Glu Leu Ala Glu Ser lie Ser Glu Gly 
15 10 15 

ACA ATA GCC CAA TGG TTA AAG CAG CCT GGT GAC TAT GTA GAA CAG GGT 96 
Thr He Ala Gin Trp Leu Lys Gin Pro Gly Asp Tyr Val Glu Gin Gly 
20 25 30 

GAA TAT CTG CTT GAA CTA GAA ACG GAT AAA GTG AAT GTT GAA TTG ACA 144 
Glu Tyr Leu Leu Glu Leu Glu Thr Asp Lys Val Asn Val Glu Leu Thr 
35 40 45 

GCA GAA GAA TCG GGT GTA CTT CAA GAG GTA TTG AAA GAT TCG GGT GAT 192 
Ala Glu Glu Ser Gly Val Leu Gin Glu Val Leu Lys Asp Ser Gly Asp 
50 55 60 

ACC GTC CAG GTC GGA GAA ATT ATC GGT ACG ATT TCA GAA GGC GCG GGT 240 
Thr Val Gin Val Gly Glu He He Gly Thr He Ser Glu Gly Ala Gly 
65 70 75 80 

GAA AGT TCT GCC CCT GCT CCT ACA GAG AAA ACA GAA AGC AAG GAA AGC 288 
Glu Ser Ser Ala Pro Ala Pro Thr Glu Lys Thr Glu Ser Lys Glu Ser 
85 90 95 

GTA AAA GAA GAG AAA CAG GCT GAA CCA GCT GCA CAA GAG GTG AGC GAG 33 6 

Val Lys Glu Glu Lys Gin Ala Glu Pro Ala Ala Gin Glu Val Ser Glu 
100 105 110 

GAA GCA CAA TCT GAA GCA AAA TCA AGA ACG ATC GCT TCT CCG TCG GCC 3 84 

Glu Ala Gin Ser Glu Ala Lys Ser Arg Thr He Ala Ser Pro Ser Ala 
115 120 125 

CGT AAG CTT GCG AGA GAA AAA GGA ATT GAC CTG TCT CAA GTT CCA ACT 432 
Arg Lys Leu Ala Arg Glu Lys Gly He Asp Leu Ser Gin Val Pro Thr 
130 135 140 

GGA GAT CCG CTT GGA AGA GTG CGC AAG CAG GAT GTC GAA GCG TAC GAA 4 80 

Gly Asp Pro Leu Gly Arg Val Arg Lys Gin Asp Val Glu Ala Tyr Glu 
145 150 155 160 

AAA CCG GCA TCA AAA CCG GCT CCT CAG CAA AAG CAG CAG CCT CAG GCT 52 8 

Lys Pro Ala Ser Lys Pro Ala Pro Gin Gin Lys Gin Gin Pro Gin Ala 
165 170 175 

CAA AAA GCA CAG CAA AGC TTT GAC AAA CCT GTT GAA GTG CAA AAA ATG 576 
Gin Lys Ala Gin Gin Ser Phe Asp Lys Pro Val Glu Val Gin Lys Met 
180 185 190 

TCA CGC CGC AGA CAA ACG ATT GCA AAA CGC CTT GTA GAG GTA CAG CAA 624 
Ser Arg Arg Arg Gin Thr He Ala Lys Arg Leu Val Glu Val Gin Gin 
195 200 205 

ACA TCA GCG ATG CTG ACT ACA TTT AAT GAA GTG GAC ATG ACG GCT GTC 672 
Thr Ser Ala Met Leu Thr Thr Phe Asn Glu Val Asp Met Thr Ala Val 
210 215 220 

ATG AAT CTC AGA AAA CGC CGC AAA GAT CAA TTT TTT GAG CAA AAT GAA 72 0 

Met Asn Leu Arg Lys Arg Arg Lys Asp Gin Phe Phe Glu Gin Asn Glu 
225 230 235 240 

GTG AAG CTC GGC TTT ATG TCT TTC TTC ACG AAA GCG GTC GTG GCT GCA 768 
Val Lys Leu Gly Phe Met Ser Phe Phe Thr Lys Ala Val Val Ala Ala 
245 250 255 
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TTG AAA AAA TAT CCG CTG TTG AAT GCA GAA ATT CAA GGC GAT GAG TTG 816 

Leu Lys Lys Tyr Pro Leu Leu Asn Ala Glu lie Gin Gly Asp Glu Leu 
260 265 270 



ATC GTT AAA AAA TTC TAC GAT ATC GGA ATC GCT GTT GCT GCT GTA GAA 864 
He Val Lys Lys Phe Tyr Asp He Gly He Ala Val Ala Ala Val Glu 
275 280 285 

GGT CTT GTC GTT CCG GTT GTA CGG GAT GCG GAT CGC CTG ACA TTT GCA 912 
Gly Leu Val Val Pro Val Val Arg Asp Ala Asp Arg Leu Thr Phe Ala 
290 295 300 

GGA ATC GAA AAA GAG ATC GGC GAG CTT GCG AAA AAA GCA AGA AAC AAT 960 
Gly He Glu Lys Glu He Gly Glu Leu Ala Lys Lys Ala Arg Asn Asn 
305 310 315 320 

AAA TTA ACC CTT AGC GAG CTT GAG GGA GGC TCC TTC ACG ATT ACA AAC 1008 
Lys Leu Thr Leu Ser Glu Leu Glu Gly Gly Ser Phe Thr He Thr Asn 
325 330 335 

GGA GGG ACT TTC GGT TCA TTG ATG TCA ACT CCA ATT TTA AAC AGC CCG 1056 
Gly Gly Thr Phe Gly Ser Leu Met Ser Thr Pro lie Leu Asn Ser Pro 
340 345 350 

CAA GTC GGT ATA CTG GGC ATG CAT AAG ATT CAG CTG CGC CCT GTA GCC 1104 
Gin Val Gly He Leu Gly Met His Lys He Gin Leu Arg Pro Val Ala 
355 360 365 

ATT GAT GAA GAG CGT TTC GAA AAC CGT CCG ATG ATG TAT ATC GCT TTA 1152 
He Asp Glu Glu Arg Phe Glu Asn Arg. Pro Met Met Tyr He Ala Leu 
370 375 380 

TCT TAT GAT CAC CGA ATT GTA GAC GGT AAA GAA GCG GTT GGT TTC CTC 1200 
Ser Tyr Asp His Arg He Val Asp Gly Lys Glu Ala Val Gly Phe Leu 
385 390 395 400 

GTG ACA ATC AAA AAT TTA CTG GAA GAT CCT GAA CAG CTT TTA TTA GAA 1248 
Val Thr He Lys Asn Leu Leu Glu Asp Pro Glu Gin Leu Leu Leu Glu 

405 410 415 

GGA TAA 12 54 

Gly 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Met Ala Glu He Lys Val Pro Glu Leu Ala Glu Ser lie Ser Glu Gly 
15 10 15 

Thr lie Ala Gin Trp Leu Lys Gin Pro Gly Asp Tyr Val Glu Gin Gly 
20 25 30 



Glu Tyr Leu Leu Glu Leu Glu Thr Asp Lys Val Asn Val Glu Leu Thr 
35- 40 45 
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Ala Glu Glu Ser Gly Val Leu Gin Glu Val Leu Lys Asp Ser Gly Asp 
50 55 60 

Thr Val Gin Val Gly Glu lie He Gly Thr He Ser Glu Gly Ala Gly 
6-5 70 75 " 80 

Glu Ser Ser Ala Pro Ala Pro Thr Glu Lys Thr Glu Ser Lys Glu Ser 
85 90 ' 95 

Val Lys Glu Glu Lys Gin Ala Glu Pro Ala Ala Gin Glu Val Ser Glu 
100 105 110 

Glu Ala Gin Ser Glu Ala Lys Ser Arg Thr He Ala Ser Pro Ser Ala 
115 120 125 

Arg Lys Leu Ala Arg Glu Lys Gly He Asp Leu Ser Gin Val Pro Thr 
130 135 140 

Gly Asp Pro Leu Gly Arg Val Arg Lys Gin Asp Val Glu Ala Tyr Glu 
145 150 155 160 

Lys Pro Ala Ser Lys Pro Ala Pro Gin Gin Lys Gin Gin Pro Gin Ala 
165 170 175 

Gin Lys Ala Gin Gin Ser Phe Asp Lys Pro Val Glu Val Gin Lys Met 
180 185 190 

Ser Arg Arg Arg Gin Thr He Ala Lys Arg Leu Val Glu Val Gin Gin 
195 200 205 

Thr Ser Ala Met Leu Thr Thr Phe Asn Glu Val Asp Met Thr Ala Val 
210 215 220 

Met Asn Leu Arg Lys Arg Arg Lys Asp Gin Phe Phe Glu Gin Asn Glu 
225 230 235 240 

Val Lys Leu Gly Phe Met Ser -Phe Phe Thr Lys Ala Val Val Ala Ala 
245 250 255 

Leu Lys Lys Tyr Pro Leu Leu Asn Ala Glu He Gin Gly Asp Glu Leu 
260 265 270 

He Val Lys Lys Phe Tyr Asp He Gly He Ala Val Ala Ala Val Glu 
275 280 285 

Gly Leu Val Val Pro Val Val Arg Asp Ala Asp Arg Leu Thr Phe Ala 
290 295 ~ 300 

Gly He Glu Lys Glu He Gly Glu Leu Ala Lys Lys Ala Arg Asn Asn 
305 310 315 320 

Lys Leu Thr Leu Ser Glu Leu Glu Gly Gly Ser Phe Thr He Thr Asn 
325 330 335 

Gly Gly Thr Phe Gly Ser Leu Met Ser Thr Pro He Leu Asn Ser Pro 
340 345 350 

Gin Val Gly lie Leu Gly Met His Lys He Gin Leu Arg Pro Val Ala 
355 360 . 365 

He Asp Glu Glu Arg Phe Glu Asn Arg Pro Met Met Tyr He Ala Leu 
370 375 380 

Ser Tyr Asp His Arg lie Val Asp Gly Lys Glu Ala Val Gly' Phe Leu 
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385 390 395 400 

Val Thr lie Lys Asn Leu Leu Glu Asp Pro Glu Gin Leu Leu Leu Glu 
405 * 410 415 

Gly 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1254 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (pl0b2) 



(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1 . . 1251 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 

ATG GCG GAA ATT AAG GTA CCT GAA TTA GCA GAA TCA ATC TCA GAA GGA 4 8 

Met Ala Glu lie Lys Val Pro Glu Leu Ala Glu Ser lie Ser Glu Gly 
1 5 10 15 

ACA ATA GCC CAA TGG TTA AAG CAG CCT GGT GAC TAT GTA GAA CAG GGT 96 
Thr lie Ala Gin Trp Leu Lys Gin Pro Gly Asp Tyr Val Glu Gin Gly 
20 25 30 

GAA TAT CTG CTT GAA CTA GAA ACG GAT AAA GTG AAT GTT GAA TTG ACA 144 
Glu Tyr Leu Leu Glu Leu Glu Thr Asp Lys Val Asn Val Glu Leu Thr . 
35 40 45 

GCA GAA GAA TCG GGT GTA CTT CAA GAG GTA TTG AAA GAT TCG GGT GAT 192 
Ala Glu Glu Ser Gly Val Leu Gin Glu Val Leu Lys Asp Ser Gly Asp 
50 55 60 

ACC GTC CAG GTC GGA GAA ATT ATC GGT ACG ATT TCA GAA GGC GCG GGT 240 
Thr Val Gin Val Gly Glu He He Gly Thr He Ser Glu Gly Ala Gly 
65 70 75 80 

GAA AGT TCT GCC CCT GCT CCT ACA GAG AAA ACA GAA AGC AAG GAA AGC 288 
Glu Ser Ser Ala Pro Ala Pro Thr Glu Lys Thr Glu Ser Lys Glu Ser 
85 90 95 

GTA AAA GAA GAG AAA CAG GCT GAA CCA GCT GCA CAA GAG GTG AGC GAG 33 6 

Val Lys Glu Glu Lys Gin Ala Glu Pro Ala Ala Gin Glu Val Ser Glu 
100 105 110 

GAA GCA CAA TCT GAA GCA AAA TCA AGA ACG ATC GCT TCT CCG TCG GCC 384 
Glu Ala Gin Ser Glu Ala Lys Ser Arg Thr He Ala Ser Pro Ser Ala 
115 120 125 

CGT AAG CTT GCG AGA GAA AAA GGA ATT GAC CTG TCT CAA GTT CCA ACT 432 
Arg Lys Leu Ala Arg Glu Lys Gly He Asp Leu Ser Gin Val Pro Thr 
130 135 140 



GGA GAT CCG CTT GGA AGA GTG CGC AAG CAG GAT GTC GAA GCG TAC GAA 



480 
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Gly Asp Pro Leu Gly Arg Val Arg Lys Gin Asp Val Glu Ala Tyr Glu 
145 150 155 160 

AAA CCG GCA TCA AAA CCG GCT CCT CAG CAA AAG CAG CAG CCT CAG GCT 528 
Lys Pro Ala Ser Lys Pro Ala Pro Gin Gin Lys Gin Gin Pro Gin Ala 
165 170 175 

CAA AAA GCA CAG CAA AGC TTT GAC AAA CCT GTT GAA GTG CAA AAA ATG 576 
Gin Lys Ala Gin Gin Ser Phe Asp Lys Pro Val Glu Val Gin Lys Met 
180 185 190 

TCA CGC CGC AGA CAA ACG ATT GCA AAA CGC CTT GTA GAG GTA CAG CAA 624 
Ser Arg Arg Arg Gin Thr lie Ala Lys Arg Leu Val Glu Val Gin Gin 
195 200 205 

ACA TCA GCG ATG CTG ACT ACA TTT AAT GAA GTG GAC ATG ACG GCT GTC 672 
Thr Ser Ala Met Leu Thr Thr Phe Asn Glu Val Asp Met Thr Ala Val 
210 215 220 

ATG AAT CTC AGA AAA CGC CGC AAA GAT CAA TTT TTT GAG CAA AAT GAA 72 0 

Met Asn Leu Arg Lys Arg Arg Lys Asp Gin Phe Phe Glu Gin Asn Glu 
225 230 235 240 

GTG AAG CTC GGC TTT ATG TCT TTC TTC ACG AAA GCG GTC GTG GCT GCA 768 
Val Lys Leu Gly Phe Met Ser Phe Phe Thr Lys Ala Val Val Ala Ala 
245 250 255 

TTG AAA AAA TAT CCG CTG TTG AAT GCA GAA ATT CAA GGC GAT GAG TTG 816 
Leu Lys Lys Tyr Pro Leu Leu Asn Ala Glu lie Gin Gly Asp Glu Leu 
260 265 270 

ATC GTT AAA AAA TTC TAC GAT ATC GGA ATC GCT GTT GCT GCT GTA GAA 864 
He Val Lys Lys Phe Tyr Asp He Gly He Ala Val Ala Ala Val Glu 
275 280 285 

GGT CTT GTC GTT CCG GTT GTA CGG GAT GCG GAT CGC CTG ACA TTT GCA ' 912 
Gly Leu Val Val Pro Val Val Arg Asp Ala Asp Arg Leu Thr Phe Ala 
290 295 300 

GGA ATC GAA AAA GAG ATC GGC GAG CTT GCG AAA AAA GCA AGA AAC AAT 960 
Gly He Glu Lys Glu He Gly Glu Leu Ala Lys Lys Ala Arg Asn Asn 
305 310 315 320 

AAA TTA ACC CTT AGC GAG CTT GAG GGA GGC TCC TTC ACG ATT ACA AAC 1008 
Lys Leu Thr Leu Ser Glu Leu Glu Gly Gly Ser Phe Thr He Thr Asn 
325 330 335 

GGA GGG ACT TTC GGT TCA TTG ATG TCA ACT CCA ATT TTA AAC AGC CCG 1056 
Gly Gly Thr Phe Gly Ser Leu Met Ser Thr Pro He Leu Asn Ser Pro 
340 ' 345 350 

CAA GTC GGT ATA CTG GGC ATG CAT AAG ATT CAG CTG CGC CCT GTA GCC 1104 
Gin Val Gly He Leu Gly Met His Lys He Gin Leu Arg Pro Val Ala 
355 360 365 

ATT GAT GAA GAG CGT TTC GAA AAC CGT CCG ATG ATG TAT ATC GCT TTA 1152 
He Asp Glu Glu Arg Phe Glu Asn Arg Pro Met Met Tyr He Ala Leu 
370 375 380 

TCT TAT GAT CAC CGA ATT GTA GAC GGT AAA GAA GCG GTT GGT TTC CTC 1200 
Ser Tyr Asp His Arg He Val Asp Gly Lys Glu Ala Val Gly Phe Leu 
385 390 395 400 



GTG ACA ATC AAA AAT TTA CTG GAA GAT CCT GAA CAG CTT TTA TTA GAA 
Val Thr He Lys Asn Leu Leu Glu Asp Pro Glu Gin Leu Leu Leu Glu 



1248 
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405 



410 



415 



GGA TAA 
Gly 



1254 



(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

Met Ala Glu He Lys Val Pro Glu Leu Ala Glu Ser He Ser Glu Gly 
15 10 15 

Thr He Ala Gin Trp Leu Lys Gin Pro Gly Asp Tyr Val Glu Gin Gly 
20 25 30 

Glu Tyr Leu Leu Glu Leu Glu Thr Asp Lys Val Asn Val Glu Leu Thr 
35 40 45 

Ala Glu Glu Ser Gly Val Leu Gin Glu Val Leu Lys Asp Ser Gly Asp 
50 55 ' 60 

Thr Val Gin Val Gly Glu He He Gly Thr He Ser Glu Gly Ala Gly 
65 70 75 80 

Glu Ser Ser Ala Pro Ala Pro Thr Glu Lys Thr Glu Ser Lys Glu Ser 
85 90 95 

Val Lys Glu Glu Lys Gin Ala Glu Pro Ala Ala Gin Glu Val Ser Glu 
100 105 110 

Glu Ala Gin Ser Glu Ala Lys Ser Arg Thr lie Ala Ser Pro Ser Ala 
115 120 125 

Arg Lys Leu Ala Arg Glu Lys Gly He Asp Leu Ser Gin Val Pro Thr 
130 135 140 

Gly Asp Pro Leu Gly Arg Val Arg Lys Gin Asp Val Glu Ala Tyr Glu 
145 150 155 160 

Lys Pro Ala Ser Lys Pro Ala Pro Gin Gin Lys Gin Gin Pro Gin Ala 
165 170 175 

Gin Lys Ala Gin Gin Ser Phe Asp Lys Pro Val Glu Val Gin Lys Met 
180 185 190 

Ser Arg Arg Arg Gin Thr He Ala Lys Arg Leu Val Glu Val Gin Gin 
195 200 - 205 

Thr Ser Ala Met Leu Thr Thr Phe Asn Glu Val Asp Met Thr Ala Val 
210 215 220 

Met Asn Leu Arg Lys Arg Arg Lys Asp Gin Phe Phe Glu Gin Asn Glu 
225 230 235 240 

Val Lys Leu Gly Phe Met Ser Phe Phe Thr Lys Ala Val Val Ala Ala 



245 



250 



255 
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Leu Lys Lys Tyr Pro Leu Leu Asn Ala Glu He Gin Gly Asp Glu Leu 
260 265 270 

He Val Lys Lys Phe Tyr Asp He Gly He Ala Val Ala Ala Val Glu 
275 280 285 

Gly Leu Val Val Pro Val Val Arg Asp Ala Asp Arg Leu Thr Phe Ala 
290 295 " 300 

Gly He Glu Lys Glu He Gly Glu Leu Ala Lys . Lys Ala Arg Asn Asn 
305 310 315 320 

Lys Leu Thr Leu Ser Glu Leu Glu Gly Gly Ser Phe Thr He Thr Asn 
325 330 335 

Gly Gly Thr Phe Gly Ser Leu Met Ser Thr Pro He Leu Asn Ser Pro 
340 345 350 

Gin Val Gly He Leu Gly Met His Lys He Gin Leu Arg Pro Val Ala 
355 360 365 

He Asp Glu Glu Arg Phe Glu Asn Arg Pro Met Met Tyr He Ala Leu 
370 " 375 380 

Ser Tyr Asp His Arg He Val Asp Gly Lys Glu Ala Val Gly Phe Leu 
385 390 395 400 

Val Thr He Lys Asn Leu Leu Glu Asp Pro Glu Gin Leu Leu Leu Glu 
405 410 415 

Gly 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) <pl0c20) " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

TTTCGAAGAT CGGCTCTTAC AATATTGACG GTAAAGAACA GTTGGATTCT TAAAAACAAT 60 

TAAAGAATTA ATTGAAAACC CAAAGACTTA TTATAGAATC TTAATCCACA ACACAAAATA 120 

GTTTACTATT CCTAAAAGCG GGATTAAATC AATAACAAAC AGCATAAGAT TATTTCCTAG 18 0 

TCGAAATATC TTACTGCTGT ACTTTATTTT TATAATGATC TTTAATGTGG GTTTAATTTT 240 

GACTACTTAA AAATATATCA TTTCTATTGA AATAGACTCA CAATACAAAT ATAGTAGAAT 300 

GCGTGTTTCA ATATGCTAAT GAATGCATTT TAGATATAAC AAACGAGAAA TATATGAATT 360 

CTATAAGCGC CTTTAGTTAA TTTAAATCTC TGAACATGAT GTAATTCGCT TTATGGACAC 42 0 
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CACATTATAT GTTTTCATGT CCTACAACAC AACACATATT CAATTGTATA TACAGATATT 4 8 

CTTAATGACA CACTCATAGC CCCATAACAA TATATGTTAC CTAT 52 
(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (pl2c32)» 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..2796 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

ATG ACT AAC GAA. AGA AAA GAA GTT TCA GAG GCT CCT GTA AAC TTC GGT 4 8 

Met Thr Asn Glu Arg Lys Glu Val Ser Glu Ala Pro Val Asn Phe Gly 
15 10 15 

GCG AAT TTA GGT CTA ATG TTA GAT CTA TAT GAT GAC TTT TTA CAA GAT 96 
Ala Asn Leu Gly Leu Met Leu Asp Leu Tyr Asp Asp Phe Leu Gin Asp 
20 25 30 

CCA TCA TCT GTA CCA GAA GAT TTA CAA GTC TTA TTC AGC ACA ATT AAG 144 
Pro Ser Ser Val Pro Glu Asp Leu Gin Val Leu Phe Ser Thr lie Lys 
35 40 45 

AAT GAT GAC TCA ATT GTA CCA GCT TTA AAA AGT ACA AGT AGT CAA AAT 192 
Asn Asp Asp Ser lie Val Pro Ala Leu Lys Ser Thr Ser Ser Gin Asn 
50 55 60 

AGC GAC GGC ACA ATT AAG CGT GTC ATG CGT TTA ATT GAT AAT ATT CGC 24 0 

Ser Asp Gly Thr lie Lys Arg Val Met Arg Leu lie Asp Asn lie Arg 
65 70 75 80 

CAA TAC GGG CAT CTT AAA GCC GAT ATT TAT CCT GTA AAT CCT CCA AAA 288 
Gin Tyr Gly His Leu Lys Ala Asp lie Tyr Pro Val Asn Pro Pro Lys 
85 90 95 

AGG AAA CAT GTA CCT AAA TTA GAG ATT GAA GAC TTT GAT TTA GAT CAA 336 
Arg Lys His Val Pro Lys Leu Glu lie Glu Asp Phe Asp Leu Asp Gin 
100 105 110 

CAG ACT TTG GAA GGT ATA TCA GCA GGA ATT GTT TCA GAT CAC TTT GCC 384 
Gin Thr Leu Glu Gly lie Ser Ala Gly lie Val Ser Asp His Phe Ala 
115 120 125 

GAC ATT TAT GAT AAT GCT TAT GAA GCA ATT TTA AGA ATG GAA AAA CGT 432 
Asp lie Tyr Asp Asn- Ala Tyr Glu Ala lie Leu Arg Met Glu Lys Arg 
130 135 140 

TAC AAA GGA CCA ATT GCA TTT GAG TAT ACA CAT ATT AAT AAC AAT ACC 480 
Tyr Lys Gly Pro He Ala Phe Glu Tyr Thr His He Asn Asn Asn Thr 
145 150 155 160 



GAA CGT GGT TGG TTA AAA AGA AGA ATT GAA ACG CCA TAT AAA GTA ACG 



528 
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Glu 


Arg Gly Trp 


Leu 


Lys 


Arg 


Arg 


He 


Glu 


Thr 


Pro 


Tyr 


Lys 


Val 


Thr 












165 










170 








175 






TTA 


AAT 


AAT 


AAC 


GAA 


AAA 


AGG 


GCA 


CTA 


TTC 


AAA 


CAA 


TTA 


GCG 


TAT 


GTT 


576 


Leu 


Asn 


Asn 


Asn 


Glu 


Lys 


Arg 


Ala 


Leu 


Phe 


Lys 


Gin 


Leu 


Ala 


Tyr 


Val 










180 










185 










190 






GAA 


GGG 


TTT 


GAA 


AAA 


TAT 


CTT 


CAT 


AAA 


AAC 


TTC 


GTT 


GGT 


GCA 


AAG 


CGT 


624 


Glu Gly Phe Glu 


Lys 


Tyr Leu 


HIS 


Lys 


Asn 


Pne 


Val 


Gly 


Ala 


Lys Arg 








195 










2 00 










205 










TTT 


TCA 


ATT 


GAA 


GGG 


GTA 


GAC 


GCA 


CTT 


GTA 


CCG 


ATG 


TTA 


CAA 


CGT 


ACT - 


672 


Phe 


Ser 


He 


Glu 


Gly Val Asp 


Ala 


Leu 


Val 


Pro 


Met 


Leu 


Gin 


Arg Thr 






210 










215 










220 












ATT 


ACG 


ATT 


GCT 


GCG 


AAA 


GAA 


GGT 


ATT 


AAA 


AAT 


ATA 


CAA 


ATA 


GGC 


ATG 


720 


lie 


Thr 


He 


Ala 


Ala 


Lys 


Glu 


Gly 


lie 


Lys 


Asn 


He 


Gin 


He 


Gly Met 




225 










230 










235 










240 




GCT 


CAC 


CGT 


GGA 


CGT 


TTA 


AAC 


GTT 


TTA 


ACG 


CAT 


GTC 


TTA 


GAA 


AAA 


CCG 


768 


Ala 


His 


Arg 


Gly 


Arg 


Leu 


Asn 


val 


Leu 


Thr 


His 


Val 


Leu 


Glu 


Lys 


Pro 












245 










250 










255 






TAC 


GAA 


ATG 


ATG 


ATT 


TCA 


GAA 


TTT 


ATG 


CAT 


ACA 


GAT 


CCA 


ATG 


AAA 


TTC 


816 


Tyr 


Glu 


Met 


Met 


He 


Ser 


Glu 


Phe 


Met 


His 


Thr 


Asp 


Pro 


Met 


Lys 


Phe 










260 










265 










270 








TTA 


CCT 


GAA 


GAT 


GGT 


AGC 


TTG 


CAG 


TTA 


ACT 


GCT 


GGA 


TGG 


ACT 


GGT 


GAT 


864 


Leu 


Pro Glu Asp 


Gly 


Ser 


Leu 


Gin 


Leu 


Thr 


Ala 


Gly 


Trp 


Thr 


Gly Asp 








275 










280 










285 










GTG 


AAA 


TAT 


CAC 


CTT 


GGT 


GGC 


ATT 


AAA 


ACT 


ACT 


GAT 


TCA 


TAC 


GGT 


ACA 


912 


Val 


Lys 


Tyr 


His 


Leu Gly Gly 


He 


Lys 


Thr 


Thr 


Asp 


Ser 


Tyr 


Gly Thr 






290 










295 










300 












ATG 


CAG 


CGT 


ATT 


GCA 


CTG 


GCT 


AAC 


AAT 


CCA 


AGT 


CAC 


TTG 


GAA 


ATT 


GTT 


960 


Met 


Gin 


Arg 


He 


Ala 


Leu 


Ala 


Asn 


Asn 


Pro 


Ser 


His 


Leu 


Glu 


He 


Val 




305 










310 










315 










320 




GCA 


CCT 


GTT 


GTT 


GAG 


GGG 


CGT 


ACG 


AGA 


GCA 


GCA 


CAA 


GAT 


GAT 


ACA 


CAA 


1008 


Ala 


Pro 


Val 


Val 


Glu Gly Arg 


Thr 


Arg 


Ala 


Ala 


Gin 


Asp 


Asp 


Thr 


Gin 












325 










330 










335 






CGA 


GCT 


GGG 


GCT 


CCG 


ACG 


ACT 


GAT 


CAT 


CAT 


AAA 


GCA 


ATG 


CCA 


ATT 


ATT 


1056 


Arg Ala Gly Ala 


Pro 


Thr 


Thr 


Asp 


His 


His 


Lys 


Ala 


Met 


Pro 


He 


He 










340 










345 










350 








ATA 


CAT 


GGC 


GAT 


GCT 


GCT 


TAT 


CCT 


GGT 


CAA 


GGA 


ATT 


AAC 


TTC 


GAA 


ACA 


1104 


He 


His 


Gly Asp 


Ala 


Ala 


Tyr 


Pro 


Gly 


Gin 


Gly 


He 


Asn 


Phe 


Glu 


Thr 








355 










360 










365 










ATG 


AAC 


TTA 


GGA 


AAC 


TTG 


AAA 


GGC 


TAT 


TCT 


ACG 


GGT 


GGT 


TCA 


TTG 


CAT 


1152 


Met 


Asn 


Leu 


Gly 


Asn 


Leu 


Lys 


Gly 


Tyr 


Ser 


Thr 


Gly 


Gly 


Ser 


Leu 


His 






370 










375 










380 












ATT 


ATT 


ACT 


AAC 


AAT 


AGA 


ATT 


GGA 


TTT 


ACT 


ACA 


GAA 


CCA 


ATT 


GAT 


GCA 


1200 


He 


He 


Thr 


Asn 


Asn Arg 


He 


Gly 


Phe 


Thr 


Thr 


Glu 


Pro 


He 


Asp Ala 




385 










390 








395 










400 




CGT 


TCA 


ACA 


ACT 


TAT 


TCT 


ACA 


GAT 


GTG 


GCC 


AAA 


GGT 


TAT 


GAT 


GTG 


CCA 


1248 


Arg 


Ser 


Thr 


Thr 


Tyr 


Ser 


Thr 


Asp 


Val 


Ala 


Lys 


Gly 


Tyr 


Asp 


Val 


Pro 





405 410 415 



ATA TTC CAT GTC AAT GCA GAT GAC GTT GAA GCT ACT ATT GAA GCA ATT 
He Phe His Vai Asn Ala Asp Asp Val Glu Ala Thr He Glu Ala He 



1296 
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420 425 430 



GAT 


ATT 


GCA 


ATG 


GAA 


TTT 


AGA 


AAA 


GAG 


TTT 


CAT 


AAA 


GAC 


GTC 


GTT 


&TT 

t\ i. J. 


1344 


Asp 


He 


Ala 


Met 


Glu 


Phe 


Arg 


Lys 


Glu 


Phe 


His 


Lys 


Asp Val 


Val 


lie 






435 










440 








445 










GAT 


TTA 


GTA 


GGT 


TAT 


CGT 


CGT 


TTC 


GGA 


CAT 


AAC 


GAA 


ATG 


GAT 


GAA 


CCA 


1392 


Asp 


Leu 


Val 


Gly 


Tyr Arg Arg 


Phe 


Gly 


His 


Asn 


Glu 


Met 


Asp 


Glu 


Pro 




450 










455 










460 










TCA 


ATT 


ACT 


AAT 


CCA 


GTT 


CCT 


TAT 


CAG 


AAT 


ATT 


CGC 


AAA 


CAT 


GAC 


TCT 


1440 


Ser 


He 


Thr 


Asn 


Pro 


Val 


Pro 


Tyr 


Gin 


Asn 


He 


Arg 


Lys 


His 


Asp 


Ser 


465 










470 










475 








480 




GTT 


GAA 


TAT 


GTG 


TTT 


GGT 


AAA 


AAG 


CTT 


GTT 


AAT 


GAA 


GGT 


GTC 


ATT 


TCA 


1488 


Val 


Glu 


Tyr 


Val 


Phe 


Gly Lys 


Lys 


Leu 


Val 


Asn 


Glu 


Gly Val 


He 


Ser 










A fl ^ 










490 










495 






GAA 


GAT 


GAA 


ATG 


CAT 


TCA 


TTT 


ATA 


GAA 


CAA 


GTC 


CAA 


AAG 


GAA 


CTA 


AGA 


Ijjo 


Glu 


Asp 


Glu 


Met 


His 


Ser 


Phe 


lie 


Glu 


Gin 


Val 


Gin 


Lys 


Glu 


Leu 


Arg 








500 










505 








510 






CAA 


GCT 


CAT 


GAT 


AAA 


ATT 


AAT 


AAA 


GCT 


GAT 


AAA 


ATG 


GAT 


AAT 


CCA 


HAT 


1 CO/1 


Gin 


Ala 


His 


Asp 


Lys 


He 


Asn 


Lys 


Ala 


Asp 


Lys 


Met 


Asp Asn 


Pro 


Asp 






515 










520 










525 








ATG 


GAA 


AAG 


CCT 


GCA 


GAT 


CTT 


GCA 


TTA 


CCG 


TTA 


CAA 


GCA 


GAC 


GAA 


CAA 


ID JZ 


Met 


Glu 


Lys 


Pro 


Ala 


Asp 


Leu 


Ala 


Leu 


Pro 


Leu 


Gin 


Ala 


Asp 


Glu 


Gin 




530 










coc 










540 










TCA 


TTT 


ACT 


TTT 


GAT 


CAC 


TTG 


AAA 


GAA 


ATA 


AAT 


GAT 


GCA 


TTG 


TTA 


ACA 


i con 
X D O U 


Ser 


Phe 


Thr 


Phe 


Asp 


His 


Leu 


Lys 


Glu 


He 


Asn 


Asp Ala 


Leu 


Leu 


Thr 




545 










550 










555 










560 




TAT 


CCG 


GAT 


GGC 


TTT 


AAC 


ATT 


TTG 


AAA 


AAG 


TTA 


AAC 


AAA 


GTT 


CTT 


GAG 


1 / Zo 


Tyr 


Pro 


Asp 


Gly 


Phe 


Asn 


Tip 
IXC 


Leu 


Lys 


Lys 


Leu 


Asn 


Lys 


Val 


Leu 


Glu 












565 










570 








575 






AAG 


CGT 


CAT 


GAG 


CCG 


TTT 


AAT 


AAA 


GAA 


GAT 


GGT 


TTA 


GTT 


GAT 


TGG 


GCA 


"[11 


Lys 


Arg 


His 


Glu 


Pro 


Phe 


Asn 


Lys 


Glu 


Asp 


Gly 


Leu 


Val 


Asp 


Trn 


Ala 










580 










585 










590 






CAA 


GCA 


GAA 


CAA 


CTT 


GCA 


TTT 


GCG 


ACA 


ATT 


TTA 


CAA 


GAT 


GGT 


ACA 


CCG 


1824 


Gin 


Ala 


Glu 


Gin 


Leu 


Ala 


Phe 


Ala 


Thr 


He 


Leu 


Gin 


Asp Gly 


Thr 


Pro 








595 










600 










605 










ATT 


CGC 


TTA 


ACT 


GGT 


CAA 


GAT 


AGT 


GAA 


CGT 


GGT 


ACA 


TTC 


AGT 


CAT 


AGG 


1872 


He 


Arg 


Leu 


Thr 


Gly Gin 


Asp 


Ser 


Glu 


Arg Gly 


Thr 


Phe 


Ser 


His 


Arg 






610 










615 










620 










CAT 


GCC 


GTG 


TTA 


CAT 


GAT 


GAG 


CAA 


ACA 


GGT 


GAA 


ACA 


TAT 


ACA 


CCT 


TTA 


1920 


His 


Ala 


Val 


Leu 


His 


Asp 


Glu 


Gin 


Thr 


Gly 


Glu 


Thr 


Tyr 


Thr 


Pro 


Leu 




625 










630 










635 










640 




CAT 


CAT 


GTT 


CCT 


GAT 


CAA 


AAA 


GCG 


ACA 


TTT 


GAT 


ATA 


CAC 


AAT 


TCT 


CCG 


1968 


His 


His 


Val 


Pro 


Asp 


Gin 


Lys 


Ala 


Thr 


Phe 


Asp 


He 


His 


Asn 


Ser 


Pro 












645 










650 










655 






CTT 


TCA 


GAA 


GCA 


GCA 


GTA 


GTT 


GGT 


TTT 


GAA 


TAC 


GGC 


TAT 


AAT 


GTG 


GAA 


2016 


Leu 


Ser 


Glu 


Ala 


Ala 


Val 


Val 


Gly 


Phe 


Glu 


Tyr 


Gly Tyr Asn 


Val 


Glu 










660 










665 










670 








AAC 


AAA 


AAA 


AGC 


TTC 


AAT 


ATT 


TGG 


GAA 


GCA 


CAA 


TAT 


GGT 


GAT 


TTT 


GCA 


2064 


Asn 


Lys 


Lys 


Ser 


Phe 


Asn 


He 


Trp 


Glu 


Ala 


Gin 


Tyr Gly Asp 


Phe 


Ala 








675 










680 










685 
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AAT 
Asn 


ATG 
Met 
690 


TCA 
Ser 


CAA 
Gin 


ATG 
Met 


ATT 
He 


TTT 
Phe 
695 


GAC 
Asp 


AAC 
Asn 


TTC 
Phe 


TTA 
Leu 


TTC 
Phe 
700 


AGT 
Ser 


TCT 

Ser 


CGC 
Arg 


TCA 
Ser 


AAA 
Lys 
705 


TGG 
Trp 


GGA 
Gly 


GAA 
Glu 


CGT 
Arg 


TCA 
Ser 
710 


GGA 
Gly 


TTA 
Leu 


ACA 
Thr 


TTA 
Leu 


TTC 
Phe 
715 


TTA 
Leu 


CCT 
Pro 


CAT 
His 


GCA 
Ala 


TAT 
Tyr 
720 


GAG 

Oil! 

bill 


GGT 
Gly 


CAA 
Gin 


GGG 
Gly 


CCT 
Pro 
725 


GAA 
Glu 


CAT 
His 


TCA 
Ser 


TCA 
Ser 


GCA 
Ala 
730 


AGA 
Arg 


TTA 
Leu 


GAG 
Glu 


CGA 
Arg 


TTT 
Phe 
735 


TTA 
Leu 


CAA 
Gin 


TTA 
Leu 


GCT 
Ala 


GCT 
Ala 
740 


GAA 
Glu 


AAT 
Asn 


AAT 
Asn 


TGC 
Cys 


ACA 
Thr 
745 


GTT 
Val 


GTC 
Val 


AAC 
Asn 


TTA 
Leu 


TCT 
Ser 
750 


AGT 
Ser 


TCA 
Ser 


AGT 
Ser 


AAT 
Asn 


TAT 
Tyr 
755 


TTC 
Phe 


CAC 
His 


TTA 
Leu 


TTG 
Leu 


CGT 
Arg 
760 


GCA 
Ala 


CAA 
Gin 


GCG 
Ala 


GCT 
Ala 


AGT 
Ser 
765 


TTA GAT 
Leu Asp 


TCT 
Ser 


GAA 
Glu 


CAA 
Gin 
770 


ATG 
Met 


CGA 
Arg 


CCA 
Pro 


TTG 
Leu 


GTT 
Val 
775 


GTT 
Val 


ATG 
Met 


TCA 
Ser 


CCA 
Pro 


AAA 
Lys 
780 


AGC 
Ser 


TTA 
Leu 


CTG 
Leu 


AGA 
Arg 


AAT 
Asn 
785 


AAA 
Lys 


ACA 
Thr 


GTT 
Val 


GCA 
Ala 


AAA 
Lys 
790 


CCA 
Pro 


ATT 
He 


GAT 
Asp 


GAA 
Glu 


TTT 
Phe 
795 


ACT 
Thr 


TCT 
Ser 


GGT GGA TTT 
Gly Gly Phe 
800 


GAG 
Glu 


CCA 
Pro 


ATT 
lie 


TTG 
Leu 


ACA 
Thr 
805 


GAA 
Glu 


TCA 
Ser 


TAT 
Tyr 


CAA 
Gin 


GCG 
Ala 
810 


GAT 
Asp 


AAG 
Lys 


GTT 
Val 


ACA 
Thr 


AAA 
Lys 
815 


GTT 
Val 


ATT 
lie 


TTG 
Leu 


GCA 
Ala 


ACT 
Thr 
820 


GGT 
Gly 


AAA 
Lys 


ATG 
Met 


TTC 
Phe 


ATT 
He 
825 


GAT 
Asp 


TTA 
Leu 


AAA 
Lys 


GAA 
Glu 


GCA 
Ala 
830 


TTA 
Leu 


GCT 
Ala 


AAA 
Lys 


AAT 
Asn 


CCA 
Pro 
835 


GAC 
Asp 


GAA 
Glu 


TCA 
Ser 


GTA 
Val 


TTA 
Leu 
840 


CTC 
Leu 


GTT 
Val 


GCG 
Ala 


ATT 
He 


GAA 
Glu 
845 


AGA 
Arg 


TTG 
Leu 


TAT 
Tyr 


CCA 
Pro 


TTC 
Phe 
850 


CCA 
Pro 


GAG 
Glu 


GAA 
Glu 


GAG 
Glu 


ATT 
He 
855 


GAA 
Glu 


GCA 
Ala 


TTA 
Leu 


CTA 
Leu 


GCA 
Ala 
860 


CAA 
Gin 


TTG 
Leu 


CCA 
Pro 


AAC 
Asn 


CTT 
Leu 
865 


GAA 
Glu 


GAA 
Glu 


GTG 
Val 


TCA 
Ser 


TGG 
Trp 
870 


GTA 
Val 


CAA 
Gin 


GAA 
Glu 


GAA 
Glu 


CCT 
Pro 
875 


AAA 
Lys 


AAT CAA GGT GCA 
Asn Gin Gly Ala 
880 


TGG 
Trp 


TTA 
Leu 


TAT 
Tyr 


GTC 
Val 


TAT 
Tyr 
885 


CCA 
Pro 


TAT 
Tyr 


GTT 
Val 


AAA 
Lys 


GTG 
Val 
890 


CTA 
Leu 


GTT 
Val 


GCA 
Ala 


GAT 
Asp 


AAA 
Lys 
895 


TAT 
Tyr 


GAT 
Asp 


TTA 
Leu 


AGT 
Ser 


TAT 
Tyr 
900 


CAT 
His 


GGC 
Gly 


AGA 
Arg 


ATT 
He 


CAA 
Gin 
905 


AGG 
Arg 


GCT 
Ala 


GCT 
Ala 


CCA 
Pro 


GCT GAA GGC 
Ala Glu Gly 
910 


GAT 
Asp 

TTA 
Leu 


GGA 
Gly 

AAA 
Lys 
930 


GAA 
Glu 
915 
AAT 
Asn 


ATT 
He 

AAC 
Asn 


CAT 
His 

TAG 


AAA 
Lys 


CTT 
Leu 


GTT 
Val 
920 


CAA 
Gin 


AAT 
Asn 


AAA 
Lys 


ATT 
He 


ATA 
He 
925 


GAA 
Glu 


AAT 
Asn 


GCA 
Ala 



2112 

2160 

2208 

2256 

2304 

2352 

2400 

2448 

2496 

2544 

2592 

2640 

2688 

2736 

2784 
2799 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 932 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:45: 

Met Thr Asn Glu Arg Lys Glu Val Ser Glu Ala Pro Val Asn Phe Gly 
15 10 15 

Ala Asn Leu Gly Leu Met Leu Asp Leu Tyr Asp Asp Phe Leu Gin Asp 
20 25 3 0 

Pro Ser Ser Val Pro Glu Asp Leu Gin Val Leu Phe Ser Thr He Lys 
35 40 45 

Asn Asp Asp Ser He Val Pro Ala Leu Lys Ser Thr Ser Ser Gin Asn 
50 55 60 

Ser Asp Gly Thr He Lys Arg Val Met Arg Leu He Asp Asn He Arg 
65 70 75 80 

Gin Tyr Gly His Leu Lys Ala Asp He Tyr Pro Val Asn Pro Pro Lys 
85 90 95 

Arg Lys His Val Pro Lys Leu Glu He Glu Asp Phe Asp Leu Asp Gin 
100 105 110 

Gin Thr Leu Glu Gly He Ser Ala Gly He Val Ser Asp His Phe Ala 
115 120 125 

Asp He Tyr Asp Asn Ala Tyr Glu Ala He Leu Arg Met Glu Lys Arg 
130 135 140 

Tyr Lys Gly Pro lie Ala Phe Glu Tyr Thr His He Asn Asn Asn Thr 
145 150 155 160 

Glu Arg Gly Trp Leu Lys Arg Arg He Glu Thr Pro Tyr Lys Val Thr 
165 170 175 

Leu Asn Asn Asn Glu Lys Arg Ala Leu Phe Lys Gin Leu Ala Tyr Val 
180 185 190 

Glu Gly Phe Glu Lys Tyr Leu His Lys Asn Phe Val Gly Ala Lys Arg 
195 200 205 

Phe Ser He Glu Gly Val Asp Ala Leu Val Pro Met Leu Gin Arg Thr 
210 215 220 

He Thr He Ala Ala Lys Glu Gly He Lys Asn He Gin He Gly Met 
225 230 235 240 

Ala His Arg Gly Arg Leu Asn Val Leu Thr His Val Leu Glu Lys Pro 
245 250 255 

Tyr Glu Met Met He Ser Glu Phe Met His Thr Asp Pro Met Lys Phe 
260 265 270 

Leu Pro Glu Asp Gly Ser Leu Gin Leu Thr Ala Gly Trp Thr Gly Asp 
275 280 285 

Val Lys Tyr His Leu Gly Gly He Lys Thr Thr Asp Ser Tyr Gly Thr 
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290 295 300 

Met Gin Arg lie Ala Leu Ala Asn Asn Pro Ser His Leu Glu He Val 
305 310 315 320 

Ala* Pro Val Val Glu Gly Arg Thr Arg Ala Ala Gin Asp Asp Thr Gin 
325 330 335 

Arg Ala Gly Ala Pro Thr Thr Asp His His Lys Ala Met Pro He He 
340 345 350 

He His Gly Asp Ala Ala Tyr Pro Gly Gin Gly He Asn Phe Glu Thr 
355 360 365 

Met Asn Leu Gly Asn Leu Lys Gly Tyr Ser Thr Gly Gly Ser Leu His 
370 375 380 

He He Thr Asn Asn Arg He Gly Phe Thr Thr Glu Pro He Asp Ala 
385 390 395 400 

Arg Ser Thr Thr Tyr Ser Thr Asp Val Ala Lys Gly Tyr Asp Val Pro 
405 410 " * 415 

He Phe His Val Asn Ala Asp Asp Val Glu Ala Thr He Glu Ala He 
420 425 430 

Asp He Ala Met Glu Phe Arg Lys Glu Phe His Lys Asp Val Val He 
435 440 445 

Asp Leu Val Gly Tyr Arg Arg Phe Gly His Asn Glu Met Asp Glu Pro 
450 455 460 

Ser He Thr Asn Pro Val Pro Tyr Gin Asn He Arg Lys His Asp Ser 
465 470 475 " 480 

Val Glu Tyr Val Phe Gly Lys Lys Leu Val Asn Glu Gly Val He Ser 
485 490 495 

Glu Asp Glu Met His Ser Phe He Glu Gin Val Gin Lys Glu Leu Arg 
500 505 510 

Gin Ala His Asp Lys lie Asn Lys Ala Asp Lys Met Asp Asn Pro Asp 
515 520 525 

Met Glu Lys Pro Ala Asp Leu Ala Leu Pro Leu Gin Ala Asp Glu Gin 
530 535 540 

Ser Phe Thr Phe Asp His Leu Lys Glu He Asn Asp Ala Leu Leu Thr 
545 550 555 * 560 

Tyr Pro Asp Gly Phe Asn lie Leu Lys Lys Leu Asn Lys Val Leu Glu 
565 570 575 

Lys Arg His Glu Pro Phe Asn Lys Glu Asp Gly Leu Val Asp Trp Ala 
580 585 590 

Gin Ala Glu Gin Leu Ala Phe Ala Thr He Leu Gin Asp Gly Thr Pro 
595 600 605 

He Arg Leu Thr Gly Gin Asp Ser Glu Arg Gly Thr Phe Ser His Arg 
610 615 620 

His Ala Val Leu His Asp Glu Gin Thr Gly Glu Thr Tyr Thr Pro Leu 
625 630 635 ' 640 
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His His Val Pro Asp Gin Lys Ala Thr Phe Asp lie His Asn Ser Pro 
645 650 655 

Leu Ser Glu Ala Ala Val Val Gly Phe Glu Tyr Gly Tyr Asn Val Glu 
660 665 ' ' 670 

Asn Lys Lys Ser Phe Asn lie Trp Glu Ala Gin Tyr Gly Asp Phe Ala 
675 680 685 

Asn Met Ser Gin Met He Phe Asp Asn Phe Leu Phe Ser Ser Arg Ser 
690 695 700 

Lys Trp Gly Glu Arg Ser Gly Leu Thr Leu Phe Leu Pro His Ala Tyr 
705 710 715 720 

Glu Gly Gin Gly Pro Glu His Ser Ser Ala Arg Leu Glu Arg Phe Leu 
725 730 735 

Gin Leu Ala Ala Glu Asn Asn Cys Thr Val Val Asn Leu Ser Ser Ser 
740 745 750 

Ser Asn Tyr Phe His Leu Leu Arg Ala Gin Ala Ala Ser Leu Asp Ser 
755 760 765 

Glu Gin Met Arg Pro Leu Val Val Met Ser Pro Lys Ser Leu Leu Arg 
770 775 780 

Asn Lys Thr Val Ala Lys Pro He Asp Glu Phe Thr Ser Gly Gly Phe 
785 790 795 800 

Glu Pro He Leu Thr Glu Ser Tyr Gin Ala Asp Lys Val Thr Lys Val 
805 810 * 815 

He Leu Ala Thr Gly Lys Met Phe lie Asp Leu Lys Glu Ala Leu Ala 
820 825 830 

Lys Asn Pro Asp Glu Ser Val Leu Leu Val Ala He Glu Arg Leu Tyr 
835 840 845 

Pro Phe Pro Glu Glu Glu He Glu Ala Leu Leu Ala Gin Leu Pro Asn 
850 855 860 

Leu Glu Glu Val Ser Trp Val Gin Glu Glu Pro Lys Asn Gin Gly Ala 
865 % 870 875 880 

Trp Leu Tyr Val Tyr Pro Tyr Val Lys Val Leu Val Ala Asp Lys Tyr 
885 890 895 

Asp Leu Ser Tyr His Gly Arg He Gin Arg Ala Ala Pro Ala Glu Gly 
900 905 910 

Asp Gly Glu He His Lys Leu Val Gin Asn Lys He He Glu Asn Ala 
915 920 925 

Leu Lys Asn Asn 
930 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (p!0b30) » 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .2571 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

ATG CGT CTG GAT CGT CTT ACT AAT AAA TTC CAG CTT GCT CTT GCC GAT 4 8 

Met Arg Leu Asp Arg Leu Thr Asn Lys Phe Gin Leu Ala Leu Ala Asp 
15 10 15 

GCC CAA TCA CTT GCA CTC GGG CAC GAC AAC CAA TTT ATC GAA CCA CTT 96 
Ala Gin Ser Leu Ala Leu Gly His Asp Asn Gin Phe lie Glu Pro Leu 
20 25 30 

CAT TTA ATG AGC GCC CTG CTG AAT CAG GAA GGG GGT TCG GTT AGT CCT 144 
His Leu Met Ser Ala Leu Leu Asn Gin Glu Gly Gly Ser Val Ser Pro 
35 40 45 

TTA TTA ACA TCC GCT GGC ATA AAT GCT GGC CAG TTG CGC ACA GAT ATC 192 
Leu Leu Thr Ser Ala Gly lie Asn Ala Gly Gin Leu Arg Thr Asp lie 
50 55 60 

AAT CAG GCA TTA AAT CGT TTA CCG CAG GTT GAA GGT ACT GGT GGT GAT 24 0 

Asn Gin Ala Leu Asn Arg Leu Pro Gin Val Glu Gly Thr Gly Gly Asp 
65 70 75 " 80 

GTC CAG CCA TCA CAG GAT CTG GTG CGC GTT CTT AAT CTT TGC GAC AAC 2 88 

Val Gin Pro Ser Gin Asp Leu Val Arg Val Leu Asn Leu Cys Asp Asn 
85 * 90 95 

GTG GCG CAA AAA CGT GGT GAT AAC TTT ATC TCG TCA GAA CTG TTC GTT 336 
Val Ala Gin Lys Arg Gly Asp Asn Phe lie Ser Ser Glu Leu Phe Val 
100 105 110 

CTG GCG GCA CTT GAG TCT CGC GGC ACC GTG GCC GAC ATC CTG AAA GCA 3 84 

Leu Ala Ala Leu Glu Ser Arg Gly Thr Val Ala Asp lie Leu Lys Ala 
115 120 125 

GCA GGG GCG ACC ACC GCC -AAC ATT ACT CAA GCG ATT GAA CAA ATG CGT 432 
Ala Gly Ala Thr Thr Ala Asn He Thr Gin Ala He Glu Gin Met Arg 
130 135 140 

GGA GGT GAA AGC GTG AAC GAT CAA GGT GCT GAA GAC CAA CGT CAG GCT 4 80 

Gly Gly Glu Ser Val Asn Asp Gin Gly Ala Glu Asp Gin Arg Gin Ala 
145 150 155 160 

TTG AAA AAA TAT ACC ATC GAC CTT ACC GAA CGA GCC GAA CAG GGC AAA 52 8 

Leu Lys Lys Tyr Thr He Asp Leu Thr Glu Arg Ala Glu Gin Gly Lys 
165 170 175 

CTC GAT CCG GTG ATT GGT CGT GAT GAA GAA ATT CGC CGT ACC ATT CAG 576 
Leu Asp Pro Val He Gly Arg Asp Glu Glu He Arg Arg Thr He Gin 
180 185 190 

GTG CTG CAA CGT CGT ACT AAA AAT AAC CCG GTA CTG ATT GGT GAA CCC 624 
Val Leu Gin Arg Arg Thr Lys Asn Asn Pro Val Leu He Gly Glu Pro 
195 ~ ~ 200 205 



GGC GTC GGT AAA ACT GCC ATC GTT GAA GGT CTG GCG CAG CGT ATT ATC 
Gly Val Gly Lys Thr Ala He Val Glu Gly Leu Ala Gin Arg He lie 



672 



WO 99/01473 



PCT/GB98/01974 



-119- 

210 215 220 

AAC GGC GAA GTG CCG GAA GGG TTG AAA GGC CGC CGG GTA CTG GCG CTG 72 0 

Asn Gly Glu Val Pro Glu Gly Leu Lys Gly Arg Arg Val Leu Ala Leu 
225 230 235 240 

GAT ATG GGC GCG CTG GTG GCT GGG GCG AAA TAT CGC GGT GAG TTT GAA 768 
Asp Met Gly Ala Leu Val Ala Gly Ala Lys Tyr Arg Gly Glu Phe Glu 
245 250 255 

GAA CGT TTA AAA GGC GTG CTT AAC GAT CTT GCC AAA CAG GAA GGC AAC 816 
Glu Arg Leu Lys Gly Val Leu Asn Asp Leu Ala Lys Gin Glu Gly Asn 
260 265 270 

GTC ATC CTA TTT ATC GAC GAA TTA CAT ACC ATG GTC GGC GCG GGT AAA 864 
Val lie Leu Phe lie Asp Glu Leu His Thr Met Val Gly Ala Gly Lys 
275 280 285 

GCC GAT GGC GCA ATG GAC GCC GGA AAC ATG CTG AAA CCG GCG CTG GCG 912 
Ala Asp Gly Ala Met Asp Ala Gly Asn Met Leu Lys Pro Ala Leu Ala 
290 " 295 300 

CGT GGT GAA TTG CAC TGC GTA GGT GCC ACG ACG CTT GAC GAA TAT CGC 960 
Arg Gly Glu Leu His Cys Val Gly Ala Thr Thr Leu Asp Glu Tyr Arg 
305 310 315 320 

CAG TAC ATT GAA AAA GAT GCT GCG CTG GAA CGT CGT TTC CAG AAA GTG 1008 
Gin Tyr lie Glu Lys Asp Ala Ala Leu Glu Arg Arg Phe Gin Lys Val 
325 330 335 

TTT GTT GCC GAG CCT TCT GTT GAA GAT ACC ATT GCG ATT CTG CGT GGC 1056 
Phe Val Ala Glu Pro Ser Val Glu Asp Thr He Ala He Leu Arg Gly 
340 345 350 

CTG AAA GAA CGT TAC GAA TTG CAC CAC CAT GTG CAA ATT ACT GAC CCG 1104 
Leu Lys Glu Arg Tyr Glu Leu His His His Val Gin He Thr Asp Pro 
355 ~ 360 365 

GCA ATT GTT GCA GCG GCG ACG TTG TCT CAT CGC TAC ATT GCT GAC CGT 1152 
Ala He Val Ala Ala Ala Thr Leu Ser His Arg Tyr He Ala Asp Arg 
370 375 380 

CAG CTG CCG GAT AAA GCC ATC GAC CTG ATC GAT GAA GCA GCA TCC AGC 120 0 

Gin Leu Pro Asp Lys Ala He Asp Leu He Asp Glu Ala Ala Ser Ser 
385 390 395 400 

ATT CGT ATG CAG ATT GAC TCA AAA CCA GAA GAA CTC GAC CGA CTC GAT 124 8 

He Arg Met Gin He Asp Ser Lys Pro Glu Glu Leu Asp Arg Leu Asp 
405 410 415 

CGT CGT ATC ATC CAG CTC AAA CTG GAA CAA CAG GCG TTA ATG AAA GAG 1296 
Arg Arg He He Gin Leu Lys Leu Glu Gin Gin Ala Leu Met Lys Glu 
420 * 425 430 

TCT GAT GAA GCC AGT AAA AAA CGT CTG GAT ATG CTC AAC GAA GAA CTG 1344 
Ser Asp Glu Ala Ser Lys Lys Arg Leu Asp Met Leu Asn Glu Glu Leu 
435 440 445 



AGC GAC AAA GAA CGT CAG TAC TCC GAG TTA GAA GAA GAG TGG AAA GCA 13 92 

Ser Asp Lys Glu Arg Gin Tyr Ser Glu Leu Glu Glu Glu Trp Lys Ala 
450 455 460 * 

GAG AAG GCA TCG CTT TCT GGT ACG CAG ACC ATT AAA GCG GAA CTG GAA 1440 
Glu Lys Ala Ser Leu Ser Gly Thr Gin Thr He Lys Ala Glu Leu Glu 
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46 5 470 475 480 

CAG GCG AAA ATC GCT ATT GAA CAG GCT CGC CGT GTG GGG GAC CTG GCG 1488 
Gin Ala Lys lie Ala He Glu Gin Ala Arg Arg Val Gly Asp Leu Ala 
485 490 495 

CGG ATG TCT GAA CTG CAA TAC GGC AAA ATC CCG GAA CTG GAA AAG CAA 1536 
Arg Met Ser Glu Leu Gin Tyr Gly Lys He Pro Glu Leu Glu Lys Gin 
500 505 510 

CTG GAA GCC GCA ACG CAG CTC GAA GGC AAA ACT ATG CGT CTG TTG CGT 1584 
Leu Glu Ala Ala Thr Gin Leu Glu Gly Lys Thr Met Arg Leu Leu Arg 
515 520 ' 525 

AAT AAA GTG ACC GAC GCC GAA ATT GCT GAA GTG CTG GCG CGT TGG ACG 1632 
Asn Lys Val Thr Asp Ala Glu He Ala Glu Val Leu Ala Arg Trp Thr 
530 535 540 

GGG ATT CCG GTT TCT CGC ATG ATG GAA AGC GAG CGC GAA AAA CTG CTG 1680 
Gly He Pro Val Ser Arg Met Met Glu Ser Glu Arg Glu Lys Leu Leu 
545 550 555 * 560 

CGT ATG GAG CAA GAA CTG CAC CAT CGC GTA ATT GGT CAG AAC GAA GCG 1728 
Arg Met Glu Gin Glu Leu His His Arg Val He Gly Gin Asn Glu Ala 
565 570 575 

GTT GAT GCG GTA TCT AAC GCT ATT CGT CGT AGC CGT GCG GGG CTG GCG 1776 
Val Asp Ala Val Ser Asn Ala He Arg Arg Ser Arg Ala Gly Leu Ala 
580 585 " 590 

GAT CCA AAT CGC CCG ATT GGT TCA TTC CTG TTC CTC GGC CCA ACT GGT 1824 
Asp Pro Asn Arg Pro lie Gly Ser Phe Leu Phe Leu Gly Pro Thr Gly 
595 600 605 

GTG GGG AAA ACA GAG CTT TGT AAG GCG CTG GCG AAC TTT ATG TTT GAT 1872 
Val Gly Lys Thr Glu Leu Cys Lys Ala Leu Ala Asn Phe Met Phe Asp 
610 615 620 

AGC GAC GAG GCG ATG GTC CGT ATC GAT ATG TCC GAG TTT ATG GAG AAA 1920 
Ser Asp Glu Ala Met Val Arg He Asp Met Ser Glu Phe Met Glu Lys 
625 630 635 640 

CAC TCG GTG TCT CGT TTG GTT GGT GCG CCT CCG GGA TAT GTC GGT TAT 196 8 

His Ser Val Ser Arg Leu Val Gly Ala Pro Pro Gly Tyr Val Gly Tyr 
645 650 655 

GAA GAA GGT GGC TAC CTG ACC GAA GCG GTG CGT CGT CGT CCG TAT TCC 2016 
Glu Glu Gly Gly Tyr Leu Thr Glu Ala Val Arg Arg Arg Pro Tyr Ser 
660 665 670 

GTC ATC CTG CTG GAT GAA GTG GAA AAA GCG CAT CCG GAT GTC TTC AAC 2 064 

Val He Leu Leu Asp Glu Val Glu Lys Ala His Pro Asp Val Phe Asn 
675 680 685 

ATT CTG TTG CAG GTA CTG GAT GAT GGG CGT CTG ACT GAC GGG CAA GGG 2112 
He Leu Leu Gin Val Leu Asp Asp Gly Arg Leu Thr Asp Gly Gin Gly 
690 695 700 

AGA ACG GTC GAC TTC CGT AAT ACG GTC GTC ATT ATG ACC TCT AAC CTC 2160 
Arg Thr Val Asp Phe Arg Asn Thr Val Val He Met Thr Ser Asn Leu 
705 710 715 720 



GGT TCC GAT CTG ATT CAG GAA CGC TTC GGT GAA CTG GAT TAT GCG CAC 
Gly Ser Asp Leu He Gin Glu Arg Phe Gly Glu Leu Asp Tyr Ala His 
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725 730 735 

ATG AAA GAG CTG GTG CTC GGT GTG GTA AGC CAT AAC TTC CGT CCG GAA 2256 
Met Lys Glu Leu Val Leu Gly Val Val Ser His Asn Phe Arg Pro Glu 
740 745 750 

TTC ATT AAC CGT ATC GAT GAA GTG GTG GTC TTC CAT CCG CTG GGT GAA 23 04 

Phe He Asn Arg He Asp Glu Val Val Val Phe His Pro Leu Gly Glu 
755 760 765 

CAG CAC ATT GCC TCG ATT GCG CAG ATT CAG TTG AAA CGT CTG TAC AAA 23 52 

Gin His He Ala Ser He Ala Gin lie Gin Leu Lys Arg Leu Tyr Lys 
770 775 780 

CGT CTG GAA GAA CGT GGT TAT GAA ATC CAC ATT TCT GAC GAG GCG CTG 24 00 

Arg Leu Glu Glu Arg Gly Tyr Glu lie His He Ser Asp Glu Ala Leu 
785 790 795 800 

AAA CTG CTG AGC GAG AAC GGT TAC GAT CCG GTC TAT GGT GCA CGT CCT 244 8 

Lys Leu Leu Ser Glu Asn Gly Tyr Asp Pro Val Tyr Gly Ala Arg Pro 
805 810 J 815 

CTG AAA CGT GCA ATT CAG CAG CAG ATC GAA AAC CCG CTG GCA CAG CAA 24 96 

Leu Lys Arg Ala lie Gin Gin Gin He Glu Asn Pro Leu Ala Gin Gin 
820 825 830 

ATA CTG TCT GGT GAA TTG GTT CCG GGT AAA GTG ATT CGC CTG GAA GTT 2544 
lie Leu Ser Gly Glu Leu Val Pro Gly Lys Val lie Arg Leu Glu Val 
835 840 845 

AAT GAA GAC CGG ATT GTC GCC GTC CAG TAA 2574 
Asn Glu Asp Arg lie Val Ala Val Gin 
850 855 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 857 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Met Arg Leu Asp Arg Leu Thr Asn Lys Phe Gin Leu Ala Leu Ala Asp 
15 10 15 

Ala Gin Ser Leu Ala Leu Gly His Asp Asn Gin Phe lie Glu Pro Leu 
20 25 30 

His Leu Met Ser Ala Leu Leu Asn Gin Glu Gly Gly Ser Val Ser Pro 
35 40 45 

Leu Leu Thr Ser Ala Gly lie Asn Ala Gly Gin Leu Arg Thr Asp lie 
50 55 60 

Asn Gin Ala Leu Asn Arg Leu Pro Gin Val Glu Gly Thr Gly Gly Asp 
65 70 75 ' * 80 

Val Gin Pro Ser Gin Asp Leu Val Arg Val Leu Asn Leu Cys Asp Asn 
85 90 95 

Val Ala Gin Lys Arg Gly Asp Asn Phe lie Ser Ser Glu Leu Phe Val 
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105 



110 



Leu Ala Ala Leu Glu Ser Arg Gly Thr Val Ala Asp lie Leu Lys Ala 
115 120 125 

Ala- Gly Ala Thr Thr Ala Asn lie Thr Gin Ala He Glu Gin Met Arg 
130 135 140 

Gly Gly Glu Ser Val Asn Asp Gin Gly Ala Glu Asp Gin Arg Gin Ala 
145 150 155 160 

Leu Lys Lys Tyr Thr He Asp Leu Thr Glu Arg Ala Glu Gin Gly Lys 
165 170 175 

Leu Asp Pro Val He Gly Arg Asp Glu Glu He Arg Arg Thr He Gin 
180 185 190 

Val Leu Gin Arg Arg Thr Lys Asn Asn Pro Val Leu He Gly Glu Pro 
195 200 205 

Gly Val Gly Lys Thr Ala He Val Glu Gly Leu Ala Gin Arg lie lie 
210 215 220 

Asn Gly Glu Val Pro Glu Gly Leu Lys Gly Arg Arg Val Leu Ala Leu 
225 230 ' 235 ~ 240 

Asp Met Gly Ala Leu Val Ala Gly Ala Lys Tyr Arg Gly Glu Phe Glu 
245 250 255 

Glu Arg Leu Lys Gly Val Leu Asn Asp Leu Ala Lys Gin Glu Gly Asn 
260 265 270 

Val lie Leu Phe lie Asp Glu Leu His Thr Met Val Gly Ala Gly Lys 
275 280 285 

Ala Asp Gly Ala Met Asp Ala Gly Asn Met Leu Lys Pro Ala Leu Ala 
290 295 300 

Arg Gly Glu Leu His Cys Val Gly Ala Thr Thr Leu Asp Glu Tyr Arg 
305 310 * 315 320 

Gin Tyr lie Glu Lys Asp Ala Ala Leu Glu Arg Arg Phe Gin Lys Val 
325 330 335 

Phe Val Ala Glu Pro Ser Val Glu Asp Thr lie Ala lie Leu Arg Gly 
340 345 350 

Leu Lys Glu Arg Tyr Glu Leu His His His Val Gin lie Thr Asp Pro 
355 360 365 

Ala He Val Ala Ala Ala Thr Leu Ser His Arg Tyr lie Ala Asp Arg 
370 375 380 

Gin Leu Pro Asp Lys Ala lie Asp Leu lie Asp Glu Ala Ala Ser Ser 
385 390 395 400 

lie Arg Met Gin lie Asp Ser Lys Pro Glu Glu Leu Asp Arg Leu Asp 
405 410 415 

Arg Arg lie lie Gin Leu Lys Leu Glu Gin Gin Ala Leu Met Lys Glu 
420 425 430 

Ser Asp Glu Ala Ser Lys Lys Arg Leu Asp Met Leu Asn Glu Glu Leu 
435 440 445 
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Ser Asp Lys Glu Arg Gin Tyr Ser Glu Leu Glu Glu Glu Trp Lys Ala 
450 ' 455 460 

Glu Lys Ala Ser Leu Ser Gly Thr Gin Thr He Lys Ala Glu Leu Glu 
465 470 475 480 

Gin Ala Lys He Ala He Glu Gin Ala Arg Arg Val Gly Asp Leu Ala 
485 490 ~ 495 

Arg Met Ser Glu Leu Gin Tyr Gly Lys He Pro Glu Leu Glu Lys Gin 
500 505 510 

Leu Glu Ala Ala Thr Gin Leu Glu Gly Lys Thr Met Arg Leu Leu Arg 
515 520 525 

Asn Lys Val Thr Asp Ala Glu He Ala Glu Val Leu Ala Arg Trp Thr 
530 535 540 

Gly lie Pro Val Ser Arg Met Met Glu Ser Glu Arg Glu Lys Leu Leu 
545 550 555 560 

Arg Met Glu Gin Glu Leu His His Arg Val He Gly Gin Asn Glu Ala 
565 570 ' 575 

Val Asp Ala Val Ser Asn Ala He Arg Arg Ser Arg Ala Gly Leu Ala 
580 585 590 

Asp Pro Asn Arg Pro He Gly Ser Phe Leu Phe Leu Gly Pro Thr Gly 
595 600 605 

Val Gly Lys Thr Glu Leu Cys Lys Ala Leu Ala Asn Phe Met Phe Asp 
610 615 620 

Ser Asp Glu Ala Met Val Arg He Asp Met Ser Glu Phe Met Glu Lys 
625 630 635 640 

His Ser Val Ser Arg Leu Val Gly Ala Pro Pro Gly Tyr Val Gly Tyr 
645 650 655 

Glu Glu Gly Gly Tyr Leu Thr Glu Ala Val Arg Arg Arg Pro Tyr Ser 
660 665 • 670 

Val He Leu Leu Asp Glu Val Glu Lys Ala His Pro Asp Val Phe Asn 
675 680 685 

lie Leu Leu Gin Val Leu Asp Asp Gly Arg Leu Thr Asp Gly Gin Gly 
690 695 700 

Arg Thr Val Asp Phe Arg Asn Thr Val Val He Met Thr Ser Asn Leu 
705 710 715 720 

Gly Ser Asp Leu He Gin Glu Arg Phe Gly Glu Leu Asp Tyr Ala His 
725 730 735 

Met Lys Glu Leu Val Leu Gly Val Val Ser His Asn Phe Arg Pro Glu 
740 745 750 

Phe He Asn Arg He Asp Glu Val Val Val Phe His Pro Leu Gly Glu 
755 760 765 

Gin His He Ala Ser He Ala Gin He Gin Leu Lys Arg Leu Tyr Lys 
770 775 780 

Arg Leu Glu Glu Arg Gly Tyr Glu He His He Ser Asp Glu Ala Leu 
785 790 795 8 00 
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Lys Leu Leu Ser 



Leu Lys Arg Ala 
820 

lie Leu Ser Gly 
835 

Asn Glu Asp Arg 
850 

(2) INFORMATION 



Glu Asn Gly Tyr 
805 

lie Gin Gin Gin 



Glu Leu Val Pro 
84 0 

He Val Ala Val 
855 

FOR SEQ ID NO: 4 



Asp Pro Val Tyr 
810 

He Glu Asn Pro 
825 

Gly Lys Val He 
Gin 



Gly Ala Arg Pro 
815 

Leu Ala Gin Gin 
830 

Arg Leu Glu Val 
845 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) <pl3c3) M 



(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1. . 1236 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

ATG ATG AAA GAA AAA GTG ATT TTT CTC GTT GAC ATG CAA TCG TTT TAT 4 8 

Met Met Lys Glu Lys Val He Phe Leu Val Asp Met Gin Ser Phe Tyr 
15 10 15 

GCA TCT GTA GAG AAA GCG GAA AAT CCA CAT TTG AAA AAT AGG CCC GTC 96 
Ala Ser Val Glu Lys Ala Glu Asn Pro His Leu Lys Asn Arg Pro Val 
20 25 30 

ATT GTT TCG GGT GAC CCT GAA AAA AGG GGC GGA GTC GTA TTG GCT GCC 144 
He Val Ser Gly Asp Pro Glu Lys Arg Gly Gly Val Val Leu Ala Ala 
35 40 45 

TGC CCG CTG GCG AAA CAA AAG GGT GTG GTG AAT GCT TCA CGG CTG TGG 192 
Cys Pro Leu Ala Lys Gin Lys Gly Val Val Asn Ala Ser Arg Leu Trp 
50 55 60 

GAG GCG CAG GAA AAG TGT CCT GAG GCT GTT GTG CTC CGG CCG CGT ATG 240 
Glu Ala Gin Glu Lys Cys Pro Glu Ala Val Val Leu Arg Pro Arg Met 
65 70 75 ~ 80 

CAG CGG TAT ATT GAT GTA TCA CTG CAA ATT ACG GCC ATT CTC GAG GAG 288 
Gin Arg Tyr He Asp Val Ser Leu Gin He Thr Ala He Leu Glu Glu 
85 90 95 

TAT ACA GAC CTT GTG GAG CCG TAT TCC ATC GAT GAA CAG TTC ATG GAC 336 
Tyr Thr Asp Leu Val Glu Pro Tyr Ser He Asp Glu Gin Phe Met Asp 
100 105 no 

ATT ACA GGC AGC CAG AAG CTG TTT GGG ACG CCG ATG GAG ATC GCG AAA 3 84 

He Thr Gly Ser Gin Lys Leu Phe Gly Thr Pro Met Glu He Ala Lys 
115 120 125 



AGC ATT CAG GGC AGA ATC ATG CGG GAG ATC GGC GTT TAT GCA CGG GTC 
Ser He Gin Gly Arg He Met Arg Glu lie Gly Val Tyr Ala Arg Val 
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130 135 140 

GGA ATC GGC CCT AAC AAA GCG CTG GCC AAA ATT GCG TGT GAC AAT TTT 4 80 

Gly He Gly Pro Asn, Lys Ala Leu Ala Lys He Ala Cys Asp Asn Phe 
145 150 155 ~ 160 

GCC AAA AAG AAT AAG AAC GGT ATT TTT ACC TTA ACG AAA GAA AAT ATG 528 
Ala Lys Lys Asn Lys Asn Gly He Phe Thr Leu Thr Lys Glu Asn Met 
165 170 175 

AAA ACC GAA ATG TGG CCG CTC CCG GTG GGC AGC ATG TTT GGC GTC GGG 576 
Lys Thr Glu Met Trp Pro Leu Pro Val Gly Ser Met Phe Gly Val Gly 
180 185 190 

AGC CGC ATG AAG CAT CAT TTA AAT CGA ATG GGC ATC AGC ACG ATC GGC 624 
Ser Arg Met Lys His His Leu Asn Arg Met Gly He Ser Thr He Gly 
195 200 205 

GGG CTC GCG GCT TTT CCG CTC GAT CTT TTA AAA AAG AAA TGG GGC ATT 672 
Gly Leu Ala Ala Phe Pro Leu Asp Leu Leu Lys Lys Lys Trp Gly He 
210 215 220 

AAC GGC CAC GTG CTG TGG ATG ACG GCA AAC GGA ATC GAC TAT TCC CCT 720 
Asn Gly His Val Leu Trp Met Thr Ala Asn Gly He Asp Tyr Ser Pro 
225 230 235 * 240 

GTG TCA ACT TCG TCT CTG GAC GGG CAA AAG GCG ATA GGT CAT GGA ATG 768 
Val Ser Thr Ser Ser Leu Asp Gly Gin Lys Ala He Gly His Gly Met 
245 250 255 

ACT CTC CCG AGA GAC TAC GAA CAC TTT GAC AAA GAA ATC AAG GTC GTA 816 
Thr Leu Pro Arg Asp Tyr Glu His Phe Asp Lys Glu He Lys Val Val 
260 265 270 

TTG CTT GAG CTG AGT GAA GAG GTG TGC AGG CGA AGC CGA AAC GCC GGG 864 
Leu Leu Glu Leu Ser Glu Glu Val Cys Arg Arg Ser Arg Asn Ala Gly 
275 280 285 

GTC ATG GGG CAG AC A GTG TCA GTG AGC TGC CGG GGT GCT GAT TTT GAT 912 
Val Met Gly Gin Thr Val Ser Val Ser Cys Arg Gly Ala Asp Phe Asp 
290 295 300 

TGG CCG ACG GGC TTC AAC CGG CAA GTG AAG CTG GCA GAG CCG ACT AAT 960 
Trp Pro Thr Gly Phe Asn Arg Gin Val Lys Leu Ala Glu Pro Thr Asn 
305 310 315 320 

TCT ACG CAG GAT GTA TAT GAG GCT GTA CGA CGG CTG TTT CTT ACA TTT 1008 
Ser Thr Gin Asp Val Tyr Glu Ala Val Arg Arg Leu Phe Leu Thr Phe 
325 330 335 

TGG GAC GGG AAA CCC GTC CGC CGC CTC GGT GTC AAT CTG TCT CAG CTC 1056 
Trp Asp Gly Lys Pro Val Arg Arg Leu Gly Val Asn Leu Ser Gin Leu 
340 345 350 

TCA TCT GAT GAC ATA TGG CAG CTC AAT TTA TTT CAG GAT TAT GCA AAG 1104 
Ser Ser Asp Asp He Trp Gin Leu Asn Leu Phe Gin Asp Tyr Ala Lys 
355 * 360 365 

AAA ATG AGC CTA GGC TAT GTG ATG GAT GGC ATT AAA AAT CGA TTC GGC 1152 
Lys Met Ser Leu Gly Tyr Val Met Asp Gly He Lys Asn Arg Phe Gly 
370 375 380 



GAT ACA GCA ATC ATC AGG GCG GCG TCA CTG ACA GCG GCA GGC CAG GCA 
Asp Thr Ala He lie Arg Ala Ala Ser Leu Thr Ala Ala Gly Gin Ala 
385 390 395 400 
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TTT GAA CGT GCG GCT AAA ATA GGG GGG CAT TAT AAA TGA 123 9 

Phe Glu Arg Ala Ala Lys lie Gly Gly His Tyr Lys 
405 410 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Met Met Lys Glu Lys Val He Phe Leu Val Asp Met Gin Ser Phe Tyr 
1 5 10 15 

Ala Ser Val Glu Lys Ala Glu Asn Pro His Leu Lys Asn Arg Pro Val 
20 25 30 

He Val Ser Gly Asp Pro Glu Lys Arg Gly Gly Val Val Leu Ala Ala 
35 40 45 

Cys Pro Leu Ala Lys Gin Lys Gly Val Val Asn Ala Ser Arg Leu Trp 
50 55 60 

Glu Ala Gin Glu Lys Cys Pro Glu Ala Val Val Leu Arg Pro Arg Met 
65 70 75 ~ 80 

Gin Arg Tyr He Asp Val Ser Leu Gin He Thr Ala He Leu Glu Glu 
85 90 95 

Tyr Thr Asp Leu Val Glu Pro Tyr Ser He Asp Glu Gin Phe Met Asp 
100 105 110 

He Thr Gly Ser Gin Lys Leu Phe Gly Thr Pro Met Glu He Ala Lys 
115 120 125 

Ser He Gin Gly Arg He Met Arg Glu He Gly Val Tyr Ala Arg Val 
130 135 140 

Gly He Gly Pro Asn Lys Ala Leu Ala Lys He Ala Cys Asp Asn Phe 
145 150 155 160 

Ala Lys Lys Asn Lys Asn Gly He Phe Thr Leu Thr Lys Glu Asn Met 
165 170 175 

Lys Thr Glu Met Trp Pro Leu Pro Val Gly Ser Met Phe Gly Val Gly 
180 185 190 

Ser Arg Met Lys His His Leu Asn Arg Met Gly He Ser Thr He Gly 
195 200 " 205 

Gly Leu Ala Ala Phe Pro Leu Asp Leu Leu Lys Lys Lys Trp Gly He 
210 215 220 

Asn Gly His Val Leu Trp Met Thr Ala Asn Gly He Asp Tyr Ser Pro 
225 230 235 ' 240 

Val Ser Thr Ser Ser Leu Asp Gly Gin Lys Ala He Gly His Gly Met 
245 250 255 
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Thr Leu Pro Arg Asp Tyr Glu His Phe Asp Lys Glu lie Lys Val Val 
260 265 270 

Leu Leu Glu Leu Ser Glu Glu Val Cys Arg Arg Ser Arg Asn Ala Gly 
275 280 285 

Val Met Gly Gin Thr Val Ser Val Ser Cys Arg Gly Ala Asp Phe Asp 
290 295 300 

Trp Pro Thr Gly Phe Asn Arg Gin Val Lys Leu Ala Glu Pro Thr Asn 
305 310 315 320 

Ser Thr Gin Asp Val Tyr Glu Ala Val Arg Arg Leu Phe Leu Thr Phe 
325 330 335 

trp Asp Gly Lys Pro Val Arg Arg Leu Gly Val Asn Leu Ser Gin Leu 
340 345 350 

Ser Ser Asp Asp lie Trp Gin Leu Asn Leu Phe Gin Asp Tyr Ala Lys 
355 360 365 

Lys Met Ser Leu Gly Tyr Val Met Asp Gly He Lys Asn Arg Phe Gly 
370 375 380 

Asp Thr Ala He He Arg Ala Ala Ser Leu Thr Ala Ala Gly Gin Ala 
385 390 395 400 

Phe Glu Arg Ala Ala Lys He Gly Gly His Tyr Lys 
4 05 410 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = n DNA (genomic) (p4b3)" 
Unknown = Xaa 

Unknown for codon no. 2 96 = TAA (stop codon) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .144 9 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

ATG AGT AAA ATT ATT GGA TCA GAC AGA GTC AAA AGA GGT ATG GCT GAA 4 8 

Met Ser Lys He He Gly Ser Asp Arg Val Lys Arg Gly Met Ala Glu 
1 5 10 15 

ATG CAA AAA GGC GGC GTT ATT ATG GAT GTC GTT AAT GCT GAG CAA GCA 96 
Met Gin Lys Gly Gly Val He Met Asp Val Val Asn Ala Glu Gin Ala 
20 25 30 

AGA ATT GCA GAA GAA GCT GGC GCG GTA GCA GTT ATG GCA TTA GAA CGA 144 
Arg He Ala Glu Glu Ala Gly Ala Val Ala Val Met Ala Leu Glu Arg 
35 40 45 



GTA CCT TCT GAT ATT AGA GCT GCT GGT GGT GTT GCA CGT ATG GCA AAC 
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Val Pro Ser Asp lie Arg Ala Ala Gly Gly Val Ala Arg Met Ala Asn 
50 55 60 

CCT AAA ATT GTA GAA GAA GTA ATG AAT GCT GTT TCT ATT CCA GTC ATG 24 0 

Pro Lys He Val Glu Glu Val Met Asn Ala Val Ser He Pro Val Met 
65 - 70 75 80 

GCT AAA GCA CGT ATT GGT CAT ATC ACT GAA GCA AGA GTA TTA GAG GCG 288 
Ala Lys Ala Arg He Gly His He Thr Glu Ala Arg Val Leu Glu Ala 
85 90 95 

ATG GGT GTT GAC TAT ATT GAT GAA TCA GAA GTG TTA ACA CCA GCA GAT 336 
Met Gly Val Asp Tyr He Asp Glu Ser Glu Val Leu Thr Pro Ala Asp 
100 105 HO 

GAG GAA TAT CAC TTA AGA AAA GAT CAA TTT ACA GTA CCA TTT GTA TGT 384 
Glu Glu Tyr His Leu Arg Lys Asp Gin Phe Thr Val Pro Phe Val Cys 
115 120 125 

GGA TGT CGT AAT TTA GGT GAA GmT GCG CGT AGA ATT GGT GAA GGT GCT 43 2 

Gly Cys Arg Asn Leu Gly Glu Xaa Ala Arg Arg He Gly Glu Gly Ala 
130 135 140 

GCT ATG TTA CGT ACT AAA GGT GAA CCA GGT ACA GGT AAT ATT GTT GAA 48 0 

Ala Met Leu Arg Thr Lys Gly Glu Pro Gly Thr Gly Asn He Val Glu 
145 150 155 160 

GCT GTA AGA CAT ATG AGA CAA GTT AAT TCA GAA GTT AGT CGA TTG ACT 528 
Ala Val Arg His Met Arg Gin Val Asn Ser Glu Val Ser Arg Leu Thr 
165 170 175 

GTA ATG AAT GAT GAT GAG ATT ATG ACT TTT GCG AAA GAT ATC GGT GCG 576 
Val Met Asn Asp Asp Glu He Met Thr Phe Ala Lys Asp He Gly Ala 
180 185 190 

CCT TAT GAA ATT TTA AAA CAA ATT AAA GAC AAT GGT CGT TTA CCG GTA 624 
Pro Tyr Glu He Leu Lys Gin He Lys Asp Asn Gly Arg Leu Pro Val 
195 200 205 

GTT AAC TTT GCA GCT GGT GGC GTT GCG ACT CCT CAA GAT GCT GCT TTA 672 
Val Asn Phe Ala Ala Gly Gly Val Ala Thr Pro Gin Asp Ala Ala Leu 
210 215 220 

ATG ATG GAA TTA GGT GCT GAC GGT GTA TTC GTT GGA TCA GGT ATT TTT 720 
Met Met Glu Leu Gly Ala Asp Gly Val Phe Val Gly Ser Gly He Phe 
225 230 235 240 

AAA TCA GAA GAT CCA GAA AAA TTT GCT AAA GCA ATT GTT CAA GCA ACA 768 
Lys Ser Glu Asp Pro Glu Lys Phe Ala Lys Ala He Val Gin Ala Thr 
245 250 255 

ACA CAT TAC CAA GAC TAT GAA CTA ATT GGA AGA TTA GCA AGT GAA CTT 816 
Thr His Tyr Gin Asp Tyr Glu Leu He Gly Arg Leu Ala Ser Glu Leu 
260 265 270 

GGC ACT GCT ATG AAA GGT TTA GAT ATC AAT CAA TTA TCA TTA GAA GAA 864 
Gly Thr Ala Met Lys Gly Leu Asp He Asn Gin Leu Ser Leu Glu Glu 
275 280 285 

CGT ATG CAA GAG CGT GGT TGG TAA GAT ATG AAA ATA GGT GTA TTA GCA 912 
Arg Met Gin Glu Arg Gly Trp Xaa Asp Met Lys He Gly Val Leu Ala 
290 295 300 

TTA CAA GGT GCA GTA CGT GAA CAT ATT AGA CAT ATT GAA TTA AGT GGT 960 
Leu Gin Gly Ala Val Arg Glu His He Arg His He Glu Leu Ser Gly 
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305 310 315 320 

CAT GAA GGT ATT GCA GTT AAA AAA GTT GAA CAA TTA GAA GAA ATC GAG 1008 
His Glu Gly lie Ala Val Lys Lys Val Glu Gin Leu Glu Glu lie Glu 
325 330 335 

GGC TTA ATA TTA CCT GGT GGC GAG TCT ACA ACG TTA CGT CGA TTA ATG 1056 
Gly Leu lie Leu Pro Gly Gly Glu Ser Thr Thr Leu Arg Arg Leu Met 
340 345 350 

AAT TTA TAT GGA TTT AAA GAG GCT TTA CAA AAT TCA ACT TTA CCT ATG 1104 
Asn Leu Tyr Gly Phe Lys Glu Ala Leu Gin Asn Ser Thr Leu Pro Met 
355 360 365 

TTT GGT ACA TGC GCA GGA TTA ATA GTT CTA GCG CAA GAT ATA GTT GGT 1152 
Phe Gly Thr Cys Ala Gly Leu lie Val Leu Ala Gin Asp lie Val Gly 
370 375 380 

GAA GAA GGA TAC CTT AAC AAG TTG AAT ATT ACT GTA CAA CGA AAC TCA 1200 
Glu Glu Gly Tyr Leu Asn Lys Leu Asn He Thr Val Gin Arg Asn Ser 
385 390 395 400 

TTC GGT AGA CAA GTT GAC AGC TTT GAA ACA GAA TTA GAT ATT AAA GGT 1248 
Phe Gly Arg Gin Val Asp Ser Phe Glu Thr Glu Leu Asp He Lys Gly 
405 410 415 

ATC GCT ACA GAT ATT GAA GGT GTC TTT ATA AGA GCC CCA CAT ATT GAA 12 96 

He Ala Thr Asp He Glu Gly Val Phe He Arg Ala Pro His He Glu 
420. 425 430 

AAA GTA GGT CAA GGC GTA GAT ATC CTA TGT AAG GTT AAT GAG AAA ATT 1344 
Lys Val Gly Gin Gly Val Asp He Leu Cys Lys Val Asn Glu Lys He 
435 440 445 

GTA GCT GTT CAG CAA GGT AAA TAT TTA GGC GTA TCA TTC CAT CCT GAA 13 92 

Val Ala Val Gin Gin Gly Lys Tyr Leu Gly Val Ser Phe His Pro Glu 
450 455 460 

TTA ACA GAT GAC TAT AGA GTA ACT GAT TAC TTT ATT AAT CAT ATT GTA 144 0 

Leu Thr Asp Asp Tyr Arg Val Thr Asp Tyr Phe He Asn His He Val 
465 470 475 480 

AAA AAA GCA TAG 1452 
Lys Lys Ala 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 83 amino acids 
{ B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met Ser Lys He He Gly Ser Asp Arg Val Lys Arg Gly Met Ala Glu 
15 10 15 

Met Gin Lys Gly Gly Val He Met Asp Val Val Asn Ala Glu Gin Ala 
20 25 30 
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Arg He Ala Glu Glu Ala Gly Ala Val Ala Val Met Ala Leu Glu Arg 
35 40 45 

Val Pro Ser Asp He Arg Ala Ala Gly Gly Val Ala Arg Met Ala Asn 
50 55 60 

Pro Lys He Val Glu Glu Val Met Asn Ala Val Ser lie Pro Val Met 
65 70 75 80 

Ala Lys Ala Arg He Gly His He Thr Glu Ala Arg Val Leu Glu Ala 
85 90 95 

Met Gly Val Asp Tyr He Asp Glu Ser Glu Val Leu Thr Pro Ala Asp 
100 105 no 

Glu Glu Tyr His Leu Arg Lys Asp Gin Phe Thr Val Pro Phe Val Cys 
115 120 125 

Gly Cys Arg Asn Leu Gly Glu Xaa Ala Arg Arg He Gly Glu Gly Ala 
130 135 140 

Ala Met Leu Arg Thr Lys Gly Glu Pro Gly Thr Gly Asn He Val Glu 
145 150 155 160 

Ala Val Arg His Met Arg Gin Val Asn Ser Glu Val Ser Arg Leu Thr 
165 170 175 

Val Met Asn Asp Asp Glu He Met Thr Phe Ala Lys Asp lie Gly Ala 
180 185 190 

Pro Tyr Glu He Leu Lys Gin He Lys Asp Asn Gly Arg Leu Pro Val 
195 200 205 

Val Asn Phe Ala Ala Gly Gly Val Ala Thr Pro Gin Asp Ala Ala Leu 
210 215 220 

Met Met Glu Leu Gly Ala Asp Gly Val Phe Val Gly Ser Gly He Phe 
225 230 235 " 240 

Lys Ser Glu Asp Pro Glu Lys Phe Ala Lys Ala He Val Gin Ala Thr 
245 250 255 

Thr His Tyr Gin Asp Tyr Glu Leu He Gly Arg Leu Ala Ser Glu Leu 
260 265 270 

Gly Thr Ala Met Lys Gly Leu Asp He Asn Gin Leu Ser Leu Glu Glu 
275 280 285 

Arg Met Gin Glu Arg Gly Trp Xaa Asp Met Lys He Gly Val Leu Ala 
290 295 300 

Leu Gin Gly Ala Val Arg Glu His He Arg His He Glu Leu Ser Gly 
305 310 315 320 

His Glu Gly He Ala Val Lys Lys Val Glu Gin Leu Glu Glu He Glu 
325 330 335 

Gly Leu He Leu Pro Gly Gly Glu Ser Thr Thr Leu Arg Arg Leu Met 
340 345 350 

Asn Leu Tyr Gly Phe Lys Glu Ala Leu Gin Asn Ser Thr Leu Pro Met 
355 , 360 365 

Phe Gly Thr Cys Ala Gly Leu He Val Leu Ala Gin Asp He Val Gly 
370 375 380 
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Glu Glu Gly Tyr Leu Asn Lys Leu Asn He Thr Val Gin Arg 

385 390 395 

Phe Gly Arg Gin Val Asp Ser Phe Glu Thr Glu Leu Asp He 

405 410 



Asn Ser 
400 

Lys Gly 
415 



He Ala Thr Asp He Glu Gly Val Phe He Arg Ala Pro His He Glu 
420 425 430 

Lys Val Gly Gin Gly Val Asp He Leu Cys Lys Val Asn Glu Lys He 
435 440 445 

Val Ala Val Gin Gin Gly Lys Tyr Leu Gly Val Ser Phe His Pro Glu 
450 455 ' 460 

Leu Thr Asp Asp Tyr Arg Val Thr Asp Tyr Phe He Asn His He Val 
465 470 475 480 



Lys Lys Ala 



(2) INFORMATION FOR SEQ ID NO: 52: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 977 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " DNA (genomic) (p4c.63) H 
Unknown = Xaa 

Unknown for codon no. 318 = TAG (stop codon) 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1. .975 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

AAC AAA GCC TTC CAA TTA TCT GCG TCG GTA GAA CAA GTA TTA GCA ACT 4 8 

Asn Lys Ala Phe Gin Leu Ser Ala Ser Val Glu Gin Val Leu Ala Thr 
1 5 10 15 

TTA TCA CCT ACG CTA AAC AGT CCT TAC GAT TTA TAC GGC ACG ACA AAA 96 

Leu Ser Pro Thr Leu Asn Ser Pro Tyr Asp Leu Tyr Gly Thr Thr Lys 
20 25 30 

ATG CTA GAT ATT ACA TTC GAT TCA TTT GAA CAT GAT GGT ACA ACG TAC 144 

Met Leu Asp He Thr Phe Asp Ser Phe Glu His Asp Gly Thr Thr Tyr 
35 40 45 



CCT GTC GAC TAT GCT ACG TTT GAA AAT GAT TAT GAA GAT AAT AAA GAT 192 
Pro Val Asp Tyr Ala Thr Phe Glu Asn Asp Tyr Glu Asp Asn Lys Asp 
50 55 60 

CCT GAG TTT AGA CGT AAA AGT TTC AAA TCG TTT AGC GAT GGG ATT CGA 24 0 

Pro Glu Phe Arg Arg Lys Ser Phe Lys Ser Phe Ser Asp Gly He Arg 
65 70 75 ' 80 



AAA TAT CAG CAT ACT ACC GCG GCT ACA TAT AAT ATG CAA GTA CAA CAA 2 88 

Lys Tyr Gin His Thr Thr Ala Ala Thr Tyr Asn Met Gin Val Gin Gin 
85 90 95 
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GAA 


AAA 


ATT 


GAA 


GCT 


GAT 


TTA CGT 


GGA 


TTT 




1 LA 


/"•TP 
\J I L 


AIL. 


GAT 


TAT 


336 


Glu 


Lys 


He 


Glu 


Ala 


Asp 


Leu Arg 


Gly 


Phe 


Glu 


Ser 


Val 


He 


Asp 


Tyr 








100 








105 










110 




TTA. TTA 


CAT 


AGT 


CAA 


GAA 


GTA ACG 


CGT 


GAT 


ATG 


TTT 


HAP 
Vj/\L 


1 


r> 7\ TV 
LAA 


A TP 

AIC 


384 


Leu 


Leu 


His 


Ser 


Gin 


Glu 


Val Thr 


Arg Asp 


Met 


Phe 


Asp 


Arg 


Gin 


He 






115 








120 










125 








GAT 


ATG 


ATT 


ATG 


CGT 


GAC 


TTG GCA 


CCA 


GTT 


niu 


par* 


a a a 
AAA 


TAT 
1 Al 


LjLI 


AAA 


432 


Asp 


Met 


He 


Met 


Arg 


Asp Leu Ala 


Pro 


Val 


Met 


Gin 


Lys 


Tyr 


Ala 


Lys 




130 










135 








140 








CTT 


TTA 


CAA 


CGT 


ATT 


CAC 


GGA TTA 


GAT 


AAC 


AiL» 


rvT 
LCiT 


TTT 


GAA 


GAC 


TTG 


480 


Leu 


Leu 


Gin 


Arg 


He 


His Gly Leu 


Asp 


Asn 


Met 


Arg 


Phe 


Glu 


Asp 


Leu 




145 










150 








155 








160 




AAG 


ATT 


TCT 


GTA 


GAC 


CCT 


GAT TAT 


GAA 


CCA 


GAG 


ATT 


TCA 


ATT 


GAA 


GAC 


528 


Lys 


lie 


Ser 


Val 


Asp 


Pro 


Asp Tyr 


Glu 


Pro 


Glu 


He 


Ser 


He 


Glu 


Asp 












165 








170 










175 




TCA 


AAA 


AAT 


TAT 


ATT 


TTC 


GGT GCG 


TTA 


AGT 


GTT 


TTA 


GGT 


GAT 


GAC 


TAT 


576 


Ser 


Lys 


Asn 


Tyr 


He 


Phe 


Gly Ala 


Leu 


Ser 


Val 


Leu 


Gly 


Asp 


Asp 


Tyr 








180 








185 










190 




ACA 


AAC 


ATG 


TTA 


CGT 


GAA 


GCA TAC 


GAT 


CAG 


CGA 


TGG 


ATT 


GAT 


TTT 


GCA 


624 


Thr 


Asn 


Met 


Leu 


Arg 


Glu 


Ala Tyr 


Asp 


Gin 


Arg 


Trp 


He 


Asp 


Phe 


Ala 








195 








200 










205 








CAA 


AAT 


AAA 


GGT 


AAA 


GAT 


ACA GGC 


GCA 


TTT 


TGT 




ALi 1 


LL.A 


TAP 

1 AL. 


TTT 


672 


Gin 


Asn 


Lys 


Gly 


Lys 


Asp Thr Gly 


Ala 


Phe 


Cys 


Ala 


Ser 


Pro 


Tyr 


Phe 






210 










215 








220 










ALA 


CAT 


TCA 


TAT 


GTG 


TTT 


ATT TCT 


TGG 


ACT 


GGT 


AAA 


ATG 


GCT 


GAA 


GCA 


720 


Thr 


His 


Ser 


Tyr 


Val 


Phe 


He Ser 


Trp 


Thr Gly 


Lys 


Met 


Ala 


Glu 


Ala 




225 










230 








235 










240 




TTT 


GTC 


TTA 


GCA 


CAT 


GAA 


TTA GGT 


CAT 


GCA 


GGT 


CAT 


TTT 


ACA 


TTA 


GCT 


768 


Phe 


Val 


Leu 


Ala 


His 


Glu 


Leu Gly 


His 


Ala 


Gly 


nib 


trllxz 


l III 


Leu 


ax a 












245 








250 










255 






CAA 


AAA 


CAT 


CAA 


CCA 


TAT 


CTT GAA 


TCA 


GAA 


GCA 


TCA 


ATG 


TAC 


TTT 


GTT 


816 


Gin 


Lys 


His 


Gin 


Pro 


Tyr 


Leu Glu 


Ser 


Glu 


Ala 


Ser 


rlct 


Tyr 


Fne 


val 










260 








265 










270 








GAA 


GCC 


CCT 


TCT 


ACA 


ATG 


AAT GAA 


ATG 


TTG 


ATG 




ART 


TAT 

1 A 1 


TTA 

IT A 


TTT 


864 


Glu 


Ala 


Pro 


Ser 


Thr 


Met 


Asn Glu 


Met 


Leu 


Met 


Ala 


Asn 


Tyr 


Leu 


Phe 








275 








280 










285 








AAC 


ACA 


AGT 


GAT 


AAT 


CCA 


AGA TTT 


AAG 


CGT 


TGG 


GTT 


ATT 


GGC 


TCA 


ATT 


912 


Asn 


Thr 


Ser 


Asp 


Asn 


Pro 


Arg Phe 


Lys 


Arg Trp 


Val 


He 


Gly 


Ser 


He 




290 










295 








300 










TTA 


TCT 


AGA 


ACA 


TAT 


TAT 


CAT AAT 


ATG 


GTA 


CCC 


mTT 


TAT 


TAG 


AAG 


CnG 


960 


Leu 


Ser 


Arg 


Thr 


Tyr 


Tyr 


His Asn 


Met 


Val 


Pro 


Xaa 


Tyr 


Xaa 


Lys 


Xaa 


305 










310 








315 








320 




CTT 


ATC 


CAC 


GGG 


GAG 


TG 




















977 


Leu 


He 


His 


Gly 


Glu 























325 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:53: 

Asn Lys Ala Phe Gin Leu Ser Ala Ser Val Glu Gin Val Leu Ala Thr 
1 5 io 15 

Leu Ser Pro Thr Leu Asn Ser Pro Tyr Asp Leu Tyr Gly Thr Thr Lys 
20 25 30 

Met Leu Asp He Thr Phe Asp Ser Phe Glu His Asp Gly Thr Thr Tyr 
35 40 45 

Pro Val Asp Tyr Ala Thr Phe Glu Asn Asp Tyr Glu Asp Asn Lys Asp 
50 55 60 

Pro Glu Phe Arg Arg Lys Ser Phe Lys Ser Phe Ser Asp Gly He Arg 
65 70 75 80 

Lys Tyr Gin His Thr Thr Ala Ala Thr Tyr Asn Met Gin Val Gin Gin 
85 90 95 

Glu Lys He Glu Ala Asp Leu Arg Gly Phe Glu Ser Val He Asp Tyr 
100 105 no 

Leu Leu His Ser Gin Glu Val Thr Arg Asp Met Phe Asp Arg Gin He 
115 120 125 

Asp Met He Met Arg Asp Leu Ala Pro Val Met Gin Lys Tyr Ala Lys 
130 135 140 

Leu Leu Gin Arg He His Gly Leu Asp Asn Met Arg Phe Glu Asp Leu 
14 5 150 155 160 

Lys He Ser Val Asp Pro Asp Tyr Glu Pro Glu He Ser He Glu Asp 
165 170 175 

Ser Lys Asn Tyr He Phe Gly Ala Leu Ser Val Leu Gly Asp Asp Tyr 
180 185 190 

Thr Asn Met Leu Arg Glu Ala Tyr Asp Gin Arg Trp He Asp Phe Ala 
195 200 205 

Gin Asn Lys Gly Lys Asp Thr Gly Ala Phe Cys Ala Ser Pro Tyr Phe 
210 215 220 

Thr His Ser Tyr Val Phe He Ser Trp Thr Gly Lys Met Ala Glu Ala 
225 230 235 240 

Phe Val Leu Ala His Glu Leu Gly His Ala Gly His Phe Thr Leu Ala 
245 250 255 

Gin Lys His Gin Pro Tyr Leu Glu Ser Glu Ala Ser Met Tyr Phe Val 
260 265 270 

Glu Ala Pro Ser Thr Met Asn Glu Met Leu Met Ala Asn Tyr Leu Phe 
275 280 285 

Asn Thr Ser Asp Asn Pro Arg Phe Lys Arg Trp Val He Gly Ser He 
290 295 300 



WO 99/01473 



PCT/GB98/01974 



-134- 

Leu Ser Arg Thr Tyr Tyr His Asn Met Val Pro Xaa Tyr Xaa Lys Xaa 
305 310 315 320 

Leu lie His Gly Glu 
325 

(2) INFORMATION FOR SEQ ID NO:54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " DNA (genomic) (p5c3) " 
Unknown = Xaa 

Unknown for codon no. 110 = TAA (stop codon) 
Unknown for codon no. 113 = TAA (stop codon) 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1 . .465 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

ATG TAT CAA CTA CAA TTT ATA AAT TTA GTT TAC GAC ACA ACC AAA CTC 4 8 

Met Tyr Gin Leu Gin Phe He Asn Leu Val Tyr Asp Thr Thr Lys Leu 
1 5 10 15 

ACA CAT CTA GAA CAA ACC AAT ATC AAT TTA TTC ATT GGT AAT TGG AGT 96 
Thr His Leu Glu Gin Thr Asn He Asn Leu Phe He Gly Asn Trp Ser 
20 25 30 

AAT CAT CAA TTA CAA AAA TCA ATT TGT ATA CGT CAT GGC GAT GAT ACA 144 
Asn His Gin Leu Gin Lys Ser He Cys lie Arg His Gly Asp Asp Thr 
35 40 45 

AGT CAC AAT CAA TAT CAT ATT CTT TTT ATA GAT ACG GCA CAT CAA CGC 192 
Ser His Asn Gin Tyr His lie Leu Phe He Asp Thr Ala His Gin Arg 
50 55 60 

ATT AAA TTT TCA TCT ATT GAT AAT GAA GAA ATC ATT TAT ATT CTT GAT 24 0 

lie Lys Phe Ser Ser lie Asp Asn Glu Glu lie lie Tyr lie Leu Asp 
65 70 75 80 

T^T GAT GAT ACA CAG CAT ATC CTC ATG CAA ACG TCA TCC AAA CAA GGT 2 88 

Tyr Asp Asp Thr Gin His lie Leu Met Gin Thr Ser Ser Lys Gin Gly 
85 90 95 

ATT GGC ACT TCG CGA CCA ATC GTT TAT GAG CGC TTA GTA TAA CTA ATT 336 
lie Gly Thr Ser Arg Pro lie Val Tyr Glu Arg Leu Val Xaa Leu lie 
100 ~" 105 110 



TAA ATG ATT TCA CTT CAT AAA GCG GGT TGG CGA GAA TTC AAT TTC TCA 3 84 

Xaa Met lie Ser Leu His Lys Ala Gly Trp Arg Glu Phe Asn Phe Ser 

115 120 ~ 125 

CCA GCT CGT TTT TTC ATT GTA ATA ATA ATC TTT AAC ATT TAT TCT TTC 4 32 

Pro Ala Arg Phe Phe lie Val lie lie lie Phe Asn lie Tyr Ser Phe 

130 135 140 

TCT ATT AAT TTT TCT CAA ACT ATC TTA TCT TTA TGATAA 471 
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Ser He Asn Phe Ser Gin Thr He Leu Ser Leu 
145 150 155 

(2) INFORMATION FOR SEQ ID NO:55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Met Tyr Gin Leu Gin Phe He Asn Leu Val Tyr Asp Thr Thr Lys Leu 
1 5 10 15 

Thr His Leu Glu Gin Thr Asn He Asn Leu Phe He Gly Asn Trp Ser 
20 25 30 

Asn His Gin Leu Gin Lys Ser He Cys He Arg His Gly Asp Asp Thr 
35 4 0 45 

Ser His Asn Gin Tyr His He Leu Phe He Asp Thr Ala His Gin Arg 
50 55 60 

He Lys Phe Ser Ser He Asp Asn Glu Glu He He Tyr He Leu Asp 
65 70 75 80 

Tyr Asp Asp Thr Gin His He Leu Met Gin Thr Ser Ser Lys Gin Gly 
85 90 95 

lie Gly Thr Ser Arg Pro lie Val Tyr Glu Arg Leu Val Xaa Leu lie 
100 105 110 

Xaa Met He Ser Leu His Lys Ala Gly Trp Arg Glu Phe Asn Phe Ser 
115 120 125 

Pro Ala Arg Phe Phe lie Val He lie lie Phe Asn He Tyr Ser Phe 
130 135 140 

Ser He Asn Phe Ser Gin Thr lie Leu Ser Leu 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (p8d26) « 
Unknown = Xaa 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(3) LOCATION: 14.. 34 3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 
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ATTTNTAGAA TAA TAT TTC CAT ATT GGA AAA AGG GAA GAA TTC GTT ATG 49 
Tyr Phe His He Gly Lys Arg Glu Glu Phe Val Met 

AAA AGC TAT AAG TGT AAA GGT TCA TTC TTA ATA GAT AGT ATG GCT GGA 97 
Lys. Ser Tyr Lys Cys Lys Gly Ser Phe Leu He Asp Ser Met Ala Gly 
15 20 25 

TTT TTG CTA ATT GGA TTG ATN ACA TTA CTA TTG ATA CCA ATG ATG AAT 14 5 

Phe Leu Leu He Gly Leu Xaa Thr Leu Leu Leu He Pro Met Met Asn 
30 35 40 

CAA ATG CAA GCG AGT ATA AAC CAT AAA CTA CAA ACA ATT GAT GCT TCT 193 
Gin Met Gin Ala Ser He Asn His Lys Leu Gin Thr He Asp Ala Ser 
45 50 55 60 

AAA GTA ATT TTG ACG ACT GTA TCT AAA ATT AAT AAA GAA GAA CTT AAG 241 
Lys Val He Leu Thr Thr Val Ser Lys He Asn Lys Glu Glu Leu Lys 
65 70 75 

AAG GGG GTA ACT ATA GGG AAG TAT GAT ATT AAG CAA AGT GAC CAA CAA 2 89 

Lys Gly Val Thr He Gly Lys Tyr Asp He Lys Gin Ser Asp Gin Gin 
80 85 90 

ATT TGT GCT ATT TCA ANA AAT ACC ANT TCT TAT CAA AAG ACA TGT ATA 337 
He Cys Ala lie Ser Xaa Asn Thr Xaa Ser Tyr Gin Lys Thr Cys He 
95 100 105 

CAG TAT AAATGTCAAA GCTTTTTCGC TCATTGAAAT GTTAGTAGCG ATGATGGTTA 393 
Gin Tyr 
110 

TAAGTATAAC TTTACTAATT GTTCCAGACT TAATTAGACT TAGTAAAACT TTTCTAATTG 4 53 

AAAGTAGGGA TTTAACAACT GTAGATTTCG AATTTTTCTC AAGAGATATT CTAGATGATT 513 

TTAAAGGAGT AGATAGAAAC GATATTGAAA TTAGGCAACA CCGTATCATT GTACATAAAG 573 

GTGAATAAAA ATGGC 588 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Tyr Phe His He Gly Lys Arg Glu Glu Phe Val Met Lys Ser Tyr Lys 
1 5 10 15 

Cys Lys Gly Ser Phe Leu He Asp Ser Met Ala Gly Phe Leu Leu He 
20 25 * 30 

Gly Leu Xaa Thr Leu Leu Leu He Pro Met Met Asn Gin Met Gin Ala 
35 40 45 

Ser He Asn His Lys Leu Gin Thr He Asp Ala Ser Lys Val He Leu 
50 .55 60 

Thr Thr Val Ser Lys He Asn Lys Glu Glu Leu Lys Lys Gly Val Thr 
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65 70 75 80 

He Gly Lys Tyr Asp He Lys Gin Ser Asp Gin Gin He Cys Ala He 
85 90 95 

Ser Xaa Asn Thr Xaa Ser Tyr Gin Lys Thr Cys He Gin Tyr 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " DNA (genomic) (p8d26) » 
Unknown = Xaa 

Unknown for codon no. 88 = TAA (stop codon) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 318.. 587 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

ATTTATAGAA TAATATTTCC ATATTGGAAA AAGGGAAGAA TTCGTTATGA AAAGCTATAA 60 

GTGTAAAGGT TCATTCTTAA TAGATAGTAT GGCTGGATTT TTGCTAATTG GATTGATAAC 120 

ATTACTATTG ATACCAATGA TGAATCAAAT GCAAGCGAGT ATAAACCATA AACTACAAAC 180 

AATTGATGCT TCTAAAGTAA TTTTGACGAC TGTATCTAAA ATTAATAAAG AAGAACTTAA 24 0 

GAAGGGGGTA ACTATAGGGA AGTATGATAT TAAGCAAAGT GACCAACAAA TTTGTGCTAT 300 

TTCAAAAAAT ACCAATT CTT ATC AAA AGA CAT GTA TAC AGT ATA AAT GTC 350 
Leu He Lys Arg His Val Tyr Ser He Asn Val 
15 10 

AAA GCT TTT TCG CTC ATT GAA ATG TTA GTA GCG ATG ATG GTT ATA AGT 3 98 

Lys Ala Phe Ser Leu He Glu Met Leu Val Ala Met Met Val lie Ser 
15 20 25 

ATA ACT TTA CTA ATT GTT CCA GAC TTA ATT AGA CTT AGT AAA ACT TTT 446 
He Thr Leu Leu He Val Pro Asp Leu He Arg Leu Ser Lys Thr Phe 
30 35 40 

CTA ATT GAA AGT AGG GAT TTA ACA ACT GTA GAT TTC GAA TTT TTC TCA 4 94 

Leu He Glu Ser Arg Asp Leu Thr Thr Val Asp Phe Glu Phe Phe Ser 
45 . 50 55 

AGA GAT ATT CTA GAT GAT TTT AAA GGA GTA GAT AGA AAC GAT ATT GAA 542 
Arg Asp He Leu Asp Asp Phe Lys Gly Val Asp Arg Asn Asp He Glu 
60 65 70 75 

ATT AGG CAA CAC CGT ATC ATT GTA CAT AAA GGT GNN TAA AAA TGG 587 
He Arg Gin His Arg He He Val His Lys Gly Xaa Xaa Lys Trp 
80 85 90 

C 588 
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(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Leu He Lys Arg His Val Tyr Ser He Asn Val Lys Ala Phe Ser Leu 
15 10 15 

He Glu Met Leu Val Ala Met Met Val He Ser He Thr Leu Leu He 
20 25 30 

Val Pro Asp Leu He Arg Leu Ser Lys Thr Phe Leu He Glu Ser Arg 
35 40 45 

Asp Leu Thr Thr Val Asp Phe Glu Phe Phe Ser Arg Asp He Leu Asp 
50 55 60 

Asp Phe Lys Gly Val Asp Arg Asn Asp He Glu lie Arg Gin His Arg 
65 70 75 80 

He He Val His Lys Gly Xaa Xaa Lys Trp 
85 90 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1338 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " DNA (genomic) (p9b65) " 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1335 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

ATG ATC CGG CTT GTG ACG ATG GGG AAA AGT TCT GAG GCC GGG GTG TCT 4 8 

Met He Arg Leu Val Thr Met Gly Lys Ser Ser Glu Ala Gly Val Ser 
15 10 15 

TCT TTC CAG GCG TTG ACG ATG TCG TTG TCC GGC CGC ATC GGC GTC GGG 96 
Ser Phe Gin Ala Leu Thr Met Ser Leu Ser Gly Arg He Gly Val Gly 
20 25 30 

AAC GTG GCC GGG ACG GCG ACG GGG ATT GCG TAC GGG GGG CCG GGC GCG 144 
Asn Val Ala Gly Thr Ala Thr Gly He Ala Tyr Gly Gly Pro Gly Ala 
35 40 45 

GTC TTT TGG ATG TGG GTG ATC ACC TTT ATC GGA GCG GCG ACC GCG TAT 192 
Val Phe Trp Met Trp Val He Thr Phe He Gly Ala Ala Thr Ala Tyr 
50 55 60 

GTC GAG TCG ACG TGG CGC AAA TTT ATA AAG AGG AAC AAG ACG GAC AAT 24 0 
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Val Glu Ser Thr Trp Arg Lys Phe He Lys Arg Asn Lys Thr Asp Asn 
65 70 75 80 

ACC GTG GCG GTC CGG CGT TCT ACA TTG AAA AAG GCC TTG GCT GGA AAT 28 8 

Thr Val Ala Val Arg Arg Ser Thr Leu Lys Lys Ala Leu Ala Gly Asn 
85 90 95 

GGT TTG CGG TGT AGT CGC GCG GCG ATC ATT CTC TCG ATG GCG GTG CTG 336 
Gly Leu Arg Cys Ser Arg Ala Ala He He Leu Ser Met Ala Val Leu 
100 105 HO 

ATG CCG GGA ATT CAA GCA AAC TCG ATT GCC GAC AGC TTT TCG AAT GCG 384 
Met Pro Gly He Gin Ala Asn Ser He Ala Asp Ser Phe Ser Asn Ala 
115 120 125 

TTT GGC ATT CCG AAA TTG GTG ACG GGA ATT TTC GTG ATT GCC GTT CTT 432 
Phe Gly He Pro Lys Leu Val Thr Gly He Phe Val He Ala Val Leu 
130 135 140 

GGC TTT ACG ATT TTT GGC GGA GTG AAG CGG ATC GCG AAA ACG GCG GAA 480 
Gly Phe Thr He Phe Gly Gly Val Lys Arg He Ala Lys Thr Ala Glu 
145 150 155 160 

ATT GTC GTG CCG TTT ATG GCA GTT GGC TAT TTG TTC GTC GCG ATT GCC 52 8 

He Val Val Pro Phe Met Ala Val Gly Tyr Leu Phe Val Ala He Ala 
165 170 175 

ATT ATT GCG GCC AAT ATT GAA AAA GTC CCG GAT GTG TTT GGT TTG ATT 576 
He He Ala Ala Asn lie Glu Lys Val Pro Asp Val Phe Gly Leu He 
180 185 ^ 190 

TTC AAA AGC GCG TTT GGC GCT GAT CAA GTG TTT GGC GGC ATT CTT GGT 624 
Phe Lys Ser Ala Phe Gly Ala Asp Gin Val Phe Gly Gly He Leu Gly 
195 200 205 

TCG GCG GTG ATG TGG GGG GTC AAA CGC GGC CTT TAT GCG AAT GAA GCG 672 
Ser Ala Val Met Trp Gly Val Lys Arg Gly Leu Tyr Ala Asn Glu Ala 
210 215 220 

GGG CAA GGG ACG GGC GCC CAC CCG GCA GCG GCG GCG GAA GTG TCC CAC 720 
Gly Gin Gly Thr Gly Ala His Pro Ala Ala Ala Ala Glu Val Ser His 
225 230 235 240 

CCG GCG AAG CAG GGG CTT GTG CAG GCA TTT TCG ATC TAT TTG GAC GTG 768 
Pro Ala Lys Gin Gly Leu Val Gin Ala Phe Ser He Tyr Leu Asp Val 
245 250 255 

TTC TTG GTC GTG ACG GCG ACG GCG CTG ATG ATT TTG TTT ACG GGT CAA 816 
Phe Leu Val Val Thr Ala Thr Ala Leu Met He Leu Phe Thr Gly Gin 
260 265 270 

TAC AAT GTG ATC AAT GAA AAA ACG GGA GAG ACG ATT GTC GAG CAT TTG 864 
Tyr Asn Val He Asn Glu Lys Thr Gly Glu Thr He Val Glu His Leu 
275 280 285 

AAA GGG GTG GAA CCG GGC GCA GGG TAT ACG CAG GCG GCG GTG GAC ACG 912 
Lys Gly Val Glu Pro Gly Ala Gly Tyr Thr Gin Ala Ala Val Asp Thr 
290 295 300 

CTC TTC CCG GGA TTC GGG TCG GCC TTT ATT GCG ATC GCT CTG TTC TTC 960 
Leu Phe Pro Gly Phe Gly Ser Ala Phe He Ala He Ala Leu Phe Phe 
305 310 315 320 

TTC GCG TTT ACG ACG ATG TAC GCG TAT TAC TAT ATT GCC GAG ACG AAC 1008 
Phe Ala Phe Thr Thr Met Tyr Ala Tyr Tyr Tyr He Ala Glu Thr Asn 
325 330 335 
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CTC GCC TAT TTG GTG CGC AGT GAA AAG AGG GGA ACG GCC TTC TTT GCC 1056 

Leu Ala Tyr Leu Val Arg Ser Glu Lys Arg Gly Thr Ala Phe Phe Ala 
340 345 350 

TTG .AAG CTC GTC TTT TTG GCG GCC ACG TTC TAT GGA ACG GTC AAA ACG 1104 

Leu Lys Leu Val Phe Leu Ala Ala Thr Phe Tyr Gly Thr Val Lys Thr 
355 360 365 



GCG ACG ACG GCG TGG GCG ATG GGC GAC ATC GGG CTT GGC ATC ATG GTG 1152 
Ala Thr Thr Ala Trp Ala Met Gly Asp He Gly Leu Gly He Met Val 
370 375 380 

TGG CTC AAC TTG ATT GCG ATC TTG TTG TTG TTT AAA CCG GCC TAT ATG 1200 
Trp Leu Asn Leu He Ala He Leu Leu Leu Phe Lys Pro Ala Tyr Met 
385 390 395 400 

GCC TTG AAA GAT TAT GAA GAA CAG CTG AAG CAA GCG AAA GAT CCG GAG 1248 
Ala Leu Lys Asp Tyr Glu Glu Gin Leu Lys Gin Gly Lys Asp Pro Glu 
4 05 410 415 

TTC AAC GCG TCG AAA TAC GGA ATC AAG AAC GCG AAA TTC TGG GAA AAT 12 96 

Phe Asn Ala Ser Lys Tyr Gly He Lys Asn Ala Lys Phe Trp Glu Asn 
420 425 430 

GGA TAT AAG AGA TGG GAA GAA AAG AAA GGG AAG GCA TTG TAA 13 38 

Gly Tyr Lys Arg Trp Glu Glu Lys Lys Gly Lys Ala Leu 
435 440 445 

/ 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Met He Arg Leu Val Thr Met Gly Lys Ser Ser Glu Ala Gly Val Ser . 

15 10 15 

Ser Phe Gin Ala Leu Thr Met Ser Leu Ser Gly Arg He Gly Val Gly 
2 0 25 30 

Asn Val Ala Gly Thr Ala Thr Gly He Ala Tyr Gly Gly Pro Gly Ala 
35 40 45 

Val Phe Trp Met Trp Val He Thr Phe He Gly Ala Ala Thr Ala Tyr 
50 55 60 

Val Glu Ser Thr Trp Arg Lys Phe He Lys Arg Asn Lys Thr Asp Asn 
65 70 75 * 80 

Thr Val Ala Val Arg Arg Ser Thr Leu Lys Lys Ala Leu Ala Gly Asn 
85 90 95 

Gly Leu Arg Cys Ser Arg Ala Ala lie He Leu Ser Met Ala Val Leu 
100 105 110 

Met Pro Gly He Gin Ala Asn Ser lie Ala Asp Ser Phe Ser Asn Ala 
115 120 125 
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Phe Gly lie Pro Lys Leu Val Thr Gly He Phe Val He Ala Val Leu 
130 135 140 

Gly Phe Thr He Phe Gly Gly Val Lys Arg He Ala Lys Thr Ala Glu 
145 150 155 160 

He Val Val Pro Phe Met Ala Val Gly Tyr Leu Phe Val Ala He Ala 
165 170 175 

He He Ala Ala Asn He Glu Lys Val Pro Asp Val Phe Gly Leu He 
180 185 190 

Phe Lys Ser Ala Phe Gly Ala Asp Gin Val Phe Gly Gly He Leu Gly 
195 200 205 

Ser Ala Val Met Trp Gly Val Lys Arg Gly Leu Tyr Ala Asn Glu Ala 
210 215 220 

Gly Gin Gly Thr Gly Ala His Pro Ala Ala Ala Ala Glu Val Ser His 
225 230 235 240 

Pro Ala Lys Gin Gly Leu Val Gin Ala Phe Ser lie Tyr Leu Asp Val 
, 245 250 255 

Phe Leu Val Val Thr Ala Thr Ala Leu Met He Leu Phe Thr Gly Gin 
260 265 270 

Tyr Asn Val lie Asn Glu Lys Thr Gly Glu Thr He Val Glu His Leu 
275 280 285 

Lys Gly Val Glu Pro Gly Ala Gly Tyr Thr Gin Ala Ala Val Asp Thr 
290 295 300 

Leu Phe Pro Gly Phe Gly Ser Ala Phe He Ala lie Ala Leu Phe Phe 
305 310 315 320 

Phe Ala Phe Thr Thr Met Tyr Ala Tyr Tyr Tyr He Ala Glu Thr Asn 
325 330 335 

Leu Ala Tyr Leu Val Arg Ser Glu Lys Arg Gly Thr Ala Phe Phe Ala 
340 345 350 

Leu Lys Leu Val Phe Leu Ala Ala Thr Phe Tyr Gly Thr Val Lys Thr 
355 360 365 

Ala Thr Thr Ala Trp Ala Met Gly Asp He Gly. Leu Gly lie Met Val 
370 375 380 

Trp Leu Asn Leu lie Ala lie Leu Leu Leu Phe Lys Pro Ala Tyr Met 
385 390 395 4 00 

Ala Leu Lys Asp Tyr Glu Glu Gin Leu Lys Gin Gly Lys Asp Pro Glu 
405 410 415 

Phe Asn Ala Ser Lys Tyr Gly lie Lys Asn Ala Lys Phe Trp Glu Asn 
420 425 " 430 

Gly Tyr Lys Arg Trp Glu Glu Lys Lys Gly Lys Ala Leu 
435 440 445 



(2) INFORMATION FOR SEQ ID NO:62: 
(i) SEQUENCE CHARACTERISTICS : 
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( A) LENGTH: 296 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (pl0b32) » 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. .294 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:62: 



ATG CAA ATA GAA CTT ACT GAT GCA GCA GTA ACT TGG TTT AAA AAT GAA 4 8 

Met Gin lie Glu Leu Thr Asp Ala Ala Val Thr Trp Phe Lys Asn Glu 
1 5 10 15 

CTT GAG TTG CCT GAA AAT AAT AAA GTG CTC GTG TTT TTT GTA AG A TAT 96 
Leu Glu Leu Pro Glu Asn Asn Lys Val Leu Val Phe Phe Val Arg Tyr 
20 25 30 

GGT GGC GAA TTC CAA CTC AAG CAA GGA TTT AGT CCT GCT TTT ACA GTT 144 
Gly Gly Glu Phe Gin Leu Lys Gin Gly Phe Ser Pro Ala Phe Thr Val 
35 40 45 

GAA CCA AAG GAA GAT GTT GAT ATT GGC TAT GAA CAA CAA TAT GAC GAT 192 
Glu Pro Lys Glu Asp Val Asp lie Gly Tyr Glu Gin Gin Tyr Asp Asp 
50 55 60 

TTA AAT GTT GTC GTA GCG GAA AAA GAT TTG TGG TAC TTT GAA GAT GAC 240 
Leu Asn Val Val Val Ala Glu Lys Asp Leu Trp Tyr Phe Glu Asp Asp 
65 70 75 80 

CAC ATT ATT GTA AAT GTA GTT GTC ACG AAG ATG AAT TTC TTA TTC CAC 288 
His He He Val Asn Val Val Val Thr Lys Met Asn Phe Leu Phe His 
85 90 95 

CAA ATA AC 2 96 

Gin He 



(2) INFORMATION FOR SEQ ID NO: 63: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Met Gin He Glu Leu Thr Asp Ala Ala Val Thr Trp Phe Lys Asn Glu 
15 10 15 

Leu Glu Leu Pro Glu Asn Asn Lys Val Leu Val Phe Phe Val Arg Tyr 
20 25 30 
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Gly Gly Glu Phe Gin Leu Lys Gin Gly Phe Ser Pro Ala Phe Thr Val 
35 40 45 

Glu Pro Lys Glu Asp Val Asp He Gly Tyr Glu Gin Gin Tyr Asp Asp 
50 55 60 

Leu Asn Val Val Val Ala Glu Lys Asp Leu Trp Tyr Phe Glu Asp Asp 
65 70 75 80 

His He He Val Asn Val' Val Val Thr Lys Met Asn Phe Leu Phe His 
85 90 95 

Gin He 

(2) INFORMATION FOR SEQ ID NO:64: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic ) (pl0b85) " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: 

GNAACACTNC TNTTTCATTT GTGAATAATT TCACAATTAT TTTATCCTTT GGTTATGAAT 60 

ATGTAAATAC CATTAATTAA TTATTCATAT TTTATCTATA ATTTGTTAAT AAATATTTAA 120 

TATATTATNN TTATNCTTAA ATATTANGNG C AATTATAC C ACTATATTTA TTTCTTATAC 180 

TCCATTTGAT TTATAGTTGT ATTAATACAT TGACTCAAAA ACTAATTAAT CAAATATGTT 240 

TTTTAGATTN ATAAAG TTGT AACTGTACTA TTTNGTAGTG TANGGTAATT TATTNGGATG 3 00 

AAATATAATT CNNNNTACTC TATNGATTAA TCAAATATGT ATCTATCAAA ATTCGGCTTA 360 

TTTATNCACT CTGACATATC CANATACGCA AAAAGACTAT NNCTACCTTG TATCGANAGA 420 

AATAGTCTTT TTA 433 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (pl0b89) " 
Unknown = Xaa 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..207 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
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TAC TAT AAA AGC ACC CAC TCA GTC ACT AGT TTG GGC AGT TAT TGT ATG 4 8 

Tyr Tyr Lys Ser Thr His Ser Val Thr Ser Leu Gly Ser Tyr Cys Met 
1 5 10 " is 

CCT ATT GAA CTC AAT GCG TAT ATT ACA ATA CCT TTT TCG CAT ATT CAT 96 
Pro. lie Glu Leu Asn Ala Tyr He Thr He Pro Phe Ser His He His 
20 25 30 

ATA AGA TCT TTG CAT TCN TTA AGC TTA AAT TTT CTA TTC TTC NTT CTC 144 
He Arg Ser Leu His Ser Leu Ser Leu Asn Phe Leu Phe Phe Xaa Leu 
35 40 45 

TAC GGC GGC ATA GCA TTA ATA TTA CCG GAA CTA ATC CCA GTA NCN GTA 192 
Tyr Gly Gly He Ala Leu He Leu Pro Glu Leu lie Pro Val Xaa Val 
50 55 60 

TTA ATT GGA TAC CCG G 208 
Leu He Gly Tyr Pro 
65 



(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Tyr Tyr Lys Ser Thr His Ser Val Thr Ser Leu Gly Ser Tyr Cys Met 
1 5 10 15 

Pro He Glu Leu Asn Ala Tyr He Thr He Pro Phe Ser His He His 
20 25 30 



He Arg Ser Leu His Ser Leu Ser Leu Asn Phe Leu Phe Phe Xaa Leu 
35 40 45 



Tyr Gly Gly He Ala Leu He Leu Pro Glu Leu He Pro Val Xaa Val 
50 55 60 



Leu lie Gly Tyr Pro 
65 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1041 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (pl0c30) » 
Unknown = Xaa 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..1038 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 
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AGA AGT CAA ATC ATT ANT GGC GTC NTA TCG AGT ATA CTA TTA ACT TCA 4 8 

Arg Ser Gin He He Xaa Gly Val Xaa Ser Ser He Leu Leu Thr Ser 
1 5 10 is 

ACT ATT TTA GCA ATT GCA TAT ATT TTA ATG TGG TTT AAC GGC CAT ATG 96 
Thr He Leu Ala He Ala Tyr lie Leu Met Trp Phe Asn Gly His Met 
20 25 30 

AC A CTA ACT TTG ACC TTA ACG ACA ATA ATT AC A AGC TGT TTA ACC TTA 144 
Thr Leu Thr Leu Thr Leu Thr Thr He He Thr Ser Cys Leu Thr Leu 
35 40 45 

TTA ATA TGT AGT ATT TTT ATT AAT CCA CTT ATA CAA AAA ATT AAG CAG 192 
Leu He Cys Ser He Phe He Asn Pro Leu He Gin Lys He Lys Gin 
50 55 60 

TTT AAT ATA AAA ACT AAG CAA TTT GCT AAC GGA AAT TAC GCA AGC AAT 24 0 

Phe Asn He Lys Thr Lys Gin Phe Ala Asn Gly Asn Tyr Ala Ser Asn 
65 70 75 " 80 

GAT AAA ACG TTT AAT TCA CCA AAA GAA ATT TAT GAA TTA AAT CAA TCT 288 
Asp Lys Thr Phe Asn Ser Pro Lys Glu He Tyr Glu Leu Asn Gin Ser 
85 90 . - 95 

TTT AAT AAA ATG GCT TCT GAA ATT ACG CAA CAA ATG AAT CAA ATT AAA 336 
Phe Asn Lys Met Ala Ser Glu He Thr Gin Gin Met Asn Gin He Lys 
100 105 110 

TCC GAA CAA CAA GAA AAA ACA GAA CTG ATT CAA AAC TTA &CC CAT GAT 3 84 

Ser Glu Gin Gin Glu Lys Thr Glu Leu He Gin Asn Leu Ala His Asp 
115 120 125 

TTA AAA ACA CCT TTA GCA AGC ATT ATT TCA TAT TCT GAA GGA CTA CGT 432 
Leu Lys Thr Pro Leu Ala Ser He He Ser Tyr Ser Glu Gly Leu Arg 
130 135 140 

GAT GGT ATA ATC ACT AAG GAT CAT GAG ATT AAA GAG TCA TAC GAC ATA 4 80 

Asp Gly He He Thr Lys Asp His Glu He Lys Glu Ser Tyr Asp He 
145 150 155 160 

TTA ATT AAA CAA GCA AAC AGA TTA TCA ACA TTA TTT GAT GAT ATG ACT 52 8 

Leu He Lys Gin Ala Asn Arg Leu Ser Thr Leu Phe Asp Asp Met Thr 
165 170 175 

CAT ATT ATC ACT TTA AAT ACA GGT AAA ACA TAT CCC CCA GAA TTA ATA 576 
His He He Thr Leu Asn Thr Gly Lys Thr Tyr Pro Pro Glu Leu He 
180 185 190 

CAA CTA GAC CAA TTA CTT GTA TCA ATA TTG CAA CCA TAT GAG CAA CGT 624 
Gin Leu Asp Gin Leu Leu Val Ser He Leu Gin Pro Tyr Glu Gin Arg 
195 200 205 

ATC AAA CAT GAA AAC CGC ACA TTA GAA GTG AAT TTC TGT AAC GAA ATT 672 
He Lys His Glu Asn Arg Thr Leu Glu Val Asn Phe Cys Asn Glu He 
210 215 220 

GAT GCA TTT TAT CAA TAT CGA ACG CCA CTT GAG CGT ATT TTA ACA AAC 720 
Asp Ala Phe Tyr Gin Tyr Arg Thr Pro Leu Glu Arg He Leu Thr Asn 
225 230 235 240 

TTA CTT GAT AAT GCG CTA AAA TTT TCA AAT GTT GGT AGT CGC ATT GAT 768 
Leu Leu Asp Asn Ala Leu Lys Phe Ser Asn Val Gly Ser Arg He Asp 
245 250 255 
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ATT AAT ATT AGT GAA AAC GAA GAT CAA GAT ACT ATC GAC ATT GCT ATT 816 
lie Asn He Ser Glu Asn Glu Asp Gin Asp Thr lie Asp He Ala He 
260 265 270 

AGC GAT GAA GGT ATT GGC ATT ATA CCA GAA CTA CAA GAA CGT ATA TTC 864 
Ser Asp Glu Gly He Gly He He Pro Glu Leu Gin Glu Arg He Phe 
275 280 285 

GAA CGT AC A TTC AGA GTA GAA AAC TCT CGT AAT ACA AAA ACG GGT GGT 912 
Glu Arg Thr Phe Arg Val Glu Asn Ser Arg Asn Thr Lys Thr Gly Gly 
290 295 300 

TCT GGA TTA GGC TTA TAT ATA GCT AAT GAA CTC GCG CAA CAA AAT AAC 960 
Ser Gly Leu Gly Leu Tyr He Ala Asn Glu Leu Ala Gin Gin Asn Asn 
305 310 315 320 

GCA AAA ATC AGT GTA AGC AGT GAT ATA GAT GTA GGA ACT ACG ATG ACT 100 8 

Ala Lys He Ser Val Ser Ser Asp He Asp Val Gly Thr Thr Met Thr 
325 330 335 

GTA ACA TTA CAC AAA TTA GAC ATT ACG TCA TAA 1041 
Val Thr Leu His Lys Leu Asp He Thr Ser 
340 345 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 346 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Arg Ser Gin He He Xaa Gly Val Xaa Ser Ser He Leu Leu Thr Ser 
15 10 15 

Thr He Leu Ala He Ala Tyr He Leu Met Trp Phe Asn Gly His Met 
20 25 30 

Thr Leu Thr Leu Thr Leu Thr Thr He He Thr Ser Cys Leu Thr Leu 
35 40 45 

Leu He Cys Ser He Phe He Asn Pro Leu He Gin Lys He Lys Gin 
50 55 60 

Phe Asn lie Lys Thr Lys Gin Phe Ala Asn Gly Asn Tyr Ala Ser Asn 
65 70 75 ' 80 

Asp Lys Thr Phe Asn Ser Pro Lys Glu He Tyr Glu Leu Asn Gin Ser 
85 90 95 

Phe Asn Lys Met Ala Ser Glu He Thr Gin Gin Met Asn Gin He Lys 
100 105 110 

Ser Glu Gin Gin Glu Lys Thr Glu Leu He Gin Asn Leu Ala His Asp 
115 120 125 

Leu Lys Thr Pro Leu Ala Ser He He Ser Tyr Ser Glu Gly Leu Arg 
130 135 140 



Asp Gly He He Thr Lys Asp His Glu He Lys Glu Ser Tyr Asp He 
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145 150 155 160 

Leu lie Lys Gin Ala Asn Arg Leu Ser Thr Leu Phe Asp Asp Met Thr 
165 170 175 

His He He Thr Leu Asn Thr Gly Lys Thr Tyr Pro Pro Glu Leu He 
180 185 190 

Gin Leu Asp Gin Leu Leu Val Ser He Leu Gin Pro Tyr Glu Gin Arg 
195 200 205 

He Lys His Glu Asn Arg Thr Leu Glu Val Asn Phe Cys Asn Glu He 
210 215 220 

Asp Ala Phe Tyr Gin Tyr Arg Thr Pro Leu Glu Arg He Leu Thr Asn 
225 230 235 ~ 240 

Leu Leu Asp Asn Ala Leu Lys Phe Ser Asn Val Gly Ser Arg He Asp 
245 250 255 

He Asn lie Ser Glu Asn Glu Asp Gin Asp Thr He Asp He Ala He 
260 265 " 270 

Ser Asp Glu Gly He Gly He He Pro Glu Leu Gin Glu Arg He Phe 
275 280 285 

Glu Arg Thr Phe Arg Val Glu Asn Ser Arg Asn Thr Lys Thr Gly Gly 
290 295 300 

Ser Gly Leu Gly Leu Tyr He Ala Asn Glu Leu Ala Gin Gin Asn Asn 
305 310 315 320 

Ala Lys He Ser Val Ser Ser Asp He Asp Val Gly Thr Thr Met Thr 
325 330 335 

Val Thr Leu His Lys Leu Asp He Thr Ser 
340 345 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " DNA (genomic) <pl0c52) " 
Unknown = Xaa 

Unknown for codon no. 125 = TGA (stop codon) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..390 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

GTG AAC TAT GAA TAT TTC GAA TAT TCG GTT AAT TTA. GGT GGT ACC ACG 

Val Asn Tyr Glu Tyr Phe Glu Tyr Ser Val Asn Leu Gly Gly Thr Thr 

15 10 is 

yGT CAC GTC CTT TAT ATT GAT AAG GAT GCT GGC GCT TTT TTG AAA GGA 

Xaa His Val Leu Tyr He Asp Lys Asp Ala Gly Ala Phe Leu Lys Gly 

20 25 30 
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GCG TAT AGA ATG GAT ATA TTT TAT AAA AAA ATA AAA GCA AAT GTA ACG 144 
Ala Tyr Arg Met Asp He Phe Tyr Lys Lys He Lys Ala Asn Val Thr 
35 40 45 

CCC GAA GTT TTA GCA CAA CTT CAT TCC AAG AAG ATC ATT TTG GAA AGT 192 
Pro Glu Val Leu Ala Gin Leu His Ser Lys Lys He He Leu Glu Ser 
50 55 60 

ACA AAT CAA CAA CAA ACT AAA GGT CGC TAT TCA GTT GTT ATT TTT GAT 24 0 

Thr Asn Gin Gin Gin Thr Lys Gly Arg Tyr Ser Val Val He Phe Asp 
65 70 75 so 

ATT TAT GGC ACT TTA ACT TTA GAT AAT GAT GTA TTA TCA GTA AGT ACT 288 
He Tyr Gly Thr Leu Thr Leu Asp Asn Asp Val Leu Ser Val Ser Thr 
85 90 95 

TTA AAA GAA TCG TAT CAA ATC ACT GAA AGA CCG TAC CAT TAT TTA ACG 33 6 

Leu Lys Glu Ser Tyr Gin He Thr Glu Arg Pro Tyr His Tyr Leu Thr 
100 105 no 

ACT AAn ATA AAT GAA GAC TAC CAT AAT ATT CCA AGA TGA GGC AAC TTA 3 84 

Thr Xaa He Asn Glu Asp Tyr His Asn 'lie Pro Arg Xaa Gly Asn Leu 
H5 120 125 



AGT CAT TA 
Ser His 
130 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 



Val 
1 


Asn 


Tyr 


Glu 


Tyr 
5 


Phe 


Glu 


Tyr 


Ser 


Val 
10 


Asn 


Leu 


Gly Gly Thr Thr 
15 


Xaa 


His 


Val 


Leu 
20 


Tyr 


He 


Asp 


Lys 


Asp 
25 


Ala 


Gly 


Ala 


Phe 


Leu 
30 


Lys Gly 


Ala 


Tyr 


Arg 
35 


Met 


Asp 


He 


Phe 


Tyr 
40 


Lys 


Lys 


lie 


Lys 


Ala 
45 


Asn 


Val Thr 


Pro 


Glu 
50 


Val 


Leu 


Ala 


Gin 


Leu 
55 


His 


Ser 


Lys 


Lys 


He 
60 


He 


Leu 


Glu Ser 


Thr 
65 


Asn 


Gin 


Gin 


Gin 


Thr 
70 


Lys 


Gly 


Arg 


Tyr 


Ser 
75 


Val 


Val 


He 


Phe Asp 
80 


He 


Tyr 


Gly 


Thr 


Leu 
85 


Thr 


Leu 


Asp 


Asn 


Asp 
90 


Val 


Leu 


Ser 


Val 


Ser Thr 
95 


Leu 
Thr 


Lys 
Xaa 


Glu 

He 
115 


Ser 
100 
Asn 


Tyr 
Glu 


Gin 
Asp 


He 
Tyr 


Thr 

His 
120 


Glu 
105 
Asn 


Arg 
He 


Pro 
Pro 


Tyr 
Arg 


His Tyr Leu Thr 
110 

Xaa Gly Asn Leu 
125 


Ser 


His 





























392 



130 



WO 99/01473 



-149- 



PCT/GB98/01974 



(2) INFORMATION FOR SEQ ID NO: 71; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1071 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " DNA (genomic) (pl0d9) " 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .1068 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

ATG AAA ATG AGA ACA ATT GCT AAA ACC AGT TTA GCA CTA GGG CTT TTA 4 8 

Met Lys Met Arg Thr lie Ala Lys Thr Ser Leu Ala Leu Gly Leu Leu 
1 5 io 15 

ACA ACA GGC GCA ATT ACA GTA ACG ACG CAA TCG GTC AAA GCA GAA AAA 96 
Thr Thr Gly Ala He Thr Val Thr Thr Gin Ser Val Lys Ala Glu Lys 
20 25 30 

ATA CAA TCA ACT AAA GTT GAC AAA GTA CCA ACG CTT AAA GCA GAG CGA 144 
He Gin Ser Thr Lys Val Asp Lys Val Pro Thr Leu Lys Ala Glu Arg 
35 40 45 

TTA GCA ATG ATA AAC ATA ACA GCA GGT GCA AAT TCA GCG ACA ACA CAA 192 
Leu Ala Met He Asn He Thr Ala Gly Ala Asn Ser Ala ,Thr Thr Gin 
50 55 60 

GCA GCT AAC ACA AGA CAA GAA CGC ACG CCT AAA CTC GAA AAG GCA CCA 24 0 

Ala Ala Asn Thr Arg Gin Glu Arg Thr Pro Lys Leu Glu Lys Ala Pro 
65 70 75 80 

AAT ACT AAT GAG GAA AAA ACC TCA GCT TCC AAA ATA GAA AAA ATA TCA 288 
Asn Thr Asn Glu Glu Lys Thr Ser Ala Ser Lys He Glu Lys lie Ser 
85 90 95 



CAA CCT AAA CAA GAA GAG CAG AAA ACG CTT AAT ATA TCA GCA ACG CCA 336 
Gin Pro Lys Gin Glu Glu Gin Lys Thr Leu Asn He Ser Ala Thr Pro 
100 105 no 

GCG CCT AAA CAA GAA CAA TCA CAA ACG ACA ACC GAA TCC ACA ACG CCG 384 
Ala Pro Lys Gin Glu Gin Ser Gin Thr Thr Thr Glu Ser Thr Thr Pro 
115 120 125 

AAA ACT AAA GTG ACA ACA CCT CCA TCA ACA AAC ACG CCA CAA CCA ATG 432 
Lys Thr Lys Val Thr Thr Pro Pro Ser Thr Asn Thr Pro Gin Pro Met 
130 135 140 

CAA TCT ACT AAA TCA GAC ACA CCA CAA TCT CCA ACC ATA AAA CAA GCA • 4 80 

Gin Ser Thr Lys Ser Asp Thr Pro Gin Ser Pro Thr He Lys Gin Ala 
145 150 155 160 



CAA ACA GAT ATG ACT CCT AAA TAT GAA GAT TTA AGA GCG TAT TAT ACA 



528 
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Gln Thr Asp Met Thr Pro Lys Tyr Glu Asp Leu Arg Ala Tyr Tyr Thr 
165 170 175 

AAA CCG AGT TTT GAA TTT GAA AAG CAG TTT GGA TTT ATG CTC AAA CCA 576 
Lys Pro Ser Phe Glu Phe Glu Lys Gin Phe Gly Phe Met Leu Lys Pro 
180 185 190 

TGG ACG A£G GTT AGG TTT ATG AAT GTT ATT CCA AAT AGG TTC ATC TAT 624 
Trp Thr Thr Val Arg Phe Met Asn Val He Pro Asn Arg Phe He Tyr 
19 5 200 205 

AAA ATA GCT TTA GTT GGA AAA GAT GAG AAA AAA TAT AAA GAT GGA CCT 672 
Lys He Ala Leu Val Gly Lys Asp Glu Lys Lys Tyr Lys Asp Gly Pro 
210 215 220 

TAC GAT AAT ATC GAT GTA TTT ATC GTT TTA GAA GAC AAT AAA TAT CAA 720 
Tyr Asp Asn He Asp Val Phe He Val Leu Glu Asp Asn Lys Tyr Gin 
225 230 235 ^ 240 

TTG AAA AAA TAT TCT GTC GGT GGC ATC ACG AAG ACT AAT AGT AAA AAA 768 
Leu Lys Lys Tyr Ser Val Gly Gly He Thr Lys Thr Asn Ser Lys Lys 
245 250 255 

GTT AAT CAC AAA GTA GAA TTA AGC ATT ACT AAA AAA GAT AAT CAA GGT 816 
Val Asn His Lys Val Glu Leu Ser He Thr Lys Lys Asp Asn Gin Gly 
260 265 ' 270 

ATG ATT TCA CGC GAT GTT TCA GAA TAC ATG ATT ACT AAG GAA GAG ATT 864 
Met lie Ser Arg Asp Val Ser Glu Tyr Met He Thr Lys Glu Glu He 
275 280 285 

TCC TTG AAA GAG CTT GAT TTT AAA TTG AGA AAA CAA CTT ATT GAA AAA 912 
Ser Leu Lys Glu Leu Asp Phe Lys Leu Arg Lys Gin Leu lie Glu Lys 
290 295 300 

CAT AAT CTT TAC GGT AAC ATG GGT TCA GGA ACA ATC GTT ATT AAA ATG 960 
His Asn Leu Tyr Gly Asn Met Gly Ser Gly Thr lie Val lie Lys Met 
305 310 315 320 

AAA AAC GGT GGG AAA TAT ACG TTT GAA TTA CAC AAA AAA CTG CAA GAG 1008 
Lys Asn Gly Gly Lys Tyr Thr Phe Glu Leu His Lys Lys Leu Gin Glu 
325 330 335 

CAT CGT ATG GCA GAC GTC ATA GAT GGC ACT AAT ATT GAT AAC ATT GAA 1056 
His Arg Met Ala Asp Val He Asp Gly Thr Asn lie Asp Asn lie Glu 
340 345 350 

GTG AAT ATA AAA TAA 1071 
Val Asn He Lys 
355 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 356 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

Met Lys Met Arg Thr He Ala Lys Thr Ser Leu Ala Leu Gly Leu Leu 
1 5 10 15 
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Thr Thr Gly Ala He Thr Val Thr Thr Gin Ser val Lys Ala Glu Lys 
20 25 30 

He Gin Ser Thr Lys Val Asp Lys Val Pro Thr Leu Lys Ala Glu Arg 
35 40 45 

Leu Ala Met He Asn He Thr Ala Gly Ala Asn Ser Ala Thr Thr Gin 
50 55 6 o 

Ala Ala Asn Thr Arg Gin Glu Arg Thr Pro Lys Leu Glu Lys Ala Pro 
65 70 75 80 

Asn Thr Asn Glu Glu Lys Thr Ser Ala Ser Lys He Glu Lys He Ser 
8 5 90 9S 

Gin Pro Lys Gin Glu Glu Gin Lys Thr Leu Asn He Ser Ala Thr Pro 
100 105 no 

Ala Pro Lys Gin Glu Gin Ser Gin Thr Thr Thr Glu Ser Thr Thr Pro 
115 120 . 125 

Lys Thr Lys Val Thr Thr Pro Pro Ser Thr Asn Thr Pro Gin Pro Met 
130 135 140 

Gin Ser Thr Lys Ser Asp Thr Pro Gin Ser Pro Thr He Lys Gin Ala 
145 150 155 160 

Gin Thr Asp Met Thr Pro Lys Tyr Glu Asp Leu Arg Ala Tyr Tyr Thr 
165 170 175 

Lys Pro Ser Phe Glu Phe Glu Lys Gin Phe Gly Phe Met Leu Lys Pro 
180 185 190 

Trp Thr Thr Val Arg Phe Met Asn Val He Pro Asn Arg Phe He Tyr 
195 200 205 

Lys He Ala Leu Val Gly Lys Asp Glu Lys Lys Tyr Lys Asp Gly Pro 
210 215 220 

Tyr Asp Asn He Asp Val Phe He Val Leu Glu Asp Asn Lys Tyr Gin 
225 230 235 240 

Leu Lys Lys Tyr Ser Val Gly Gly He Thr Lys Thr Asn Ser Lys Lys 
245 250 255 

Val Asn His Lys Val Glu Leu Ser He Thr Lys Lys Asp Asn Gin Gly 
260 265 270 

Met He Ser Arg Asp Val Ser Glu Tyr Met He Thr Lys Glu Glu He 
275 280 285 

Ser Leu Lys Glu Leu Asp Phe Lys Leu Arg Lys Gin Leu He Glu Lys 
290 295 300 

His Asn Leu Tyr Gly Asn Met Gly Ser Gly Thr He Val He Lys Met 
305 310 315 320 

Lys Asn Gly Gly Lys Tyr Thr Phe Glu Leu His Lys Lys Leu Gin Glu 
325 330 * 335 

His Arg Met Ala Asp Val He Asp Gly Thr Asn He Asp Asn He Glu 
340 345 350 
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Val Asn lie Lys 
355 

(2) INFORMATION FOR SEQ ID NO:73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1590 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " DNA (genomic) <pllcl2) » 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..1587 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

AGA GCA AAC CCT AGA AAA GGA GGT GTT TGT GTG AAT TTA TTA AGC CTC 4 8 

Arg Ala Asn Pro Arg Lys Gly Gly Val Cys Val Asn Leu Leu Ser Leu 
1 5 10 15 

CTA CTC ATT TTG CTG GGG ATC ATT CTA GGA GTT GTT GGA GGG TAT GTT 96 
Leu Leu lie Leu Leu Gly lie He Leu Gly Val Val Gly Gly Tyr Val 
20 25 30 

GTT GCC CGA AAT TTG TTG CTT CAA AAG CAA TCA CAA GCT AGA CAA ACT 144 
Val Ala Arg Asn Leu Leu Leu Gin Lys Gin Ser Gin Ala Arg Gin Thr 
35 40 45 

GCC GAA GAT ATT GTA AAT CAA GCA CAT AAA GAA GCT GAC AAT ATC AAA 192 
Ala Glu Asp He Val Asn Gin Ala His Lys Glu Ala Asp Asn He Lys 
50 55 60 

AAA GAG AAA TTA CTT GAG GCA AAA GAA GAA AAC CAA ATC CTA AGA GAA 24 0 

Lys Glu Lys Leu Leu Glu Ala Lys Glu Glu Asn Gin He Leu Arg Glu 
65 70 75 80 

CAA ACT GAA GCA GAA CTA CGA GAA AGA CGT AGC GAA CTT CAA AGA CAA 2 88 

Gin Thr Glu Ala Glu Leu Arg Glu Arg Arg Ser Glu Leu Gin Arg Gin 
85 90 95 

GAA ACC CGA CTT CTT CAA AAA GAA GAA AAC TTA GAG CGC AAA TCT GAT ' 336 

Glu Thr Arg Leu Leu Gin Lys Glu Glu Asn Leu Glu Arg Lys Ser Asp 
100 105 no 

CTA TTA GAT AAA AAA GAT GAG ATT TTA GAG CAA AAA GAA TCA AAA ATT 3 84 

Leu Leu Asp Lys Lys Asp Glu He Leu Glu Gin Lys Glu Ser Lys He 
115 120 125 

GAA GAA AAA CAA CAA CAA GTA GAT GCA AAA GAG AGT AGT GTT CAA ACG 432 
Glu Glu Lys Gin Gin Gin Val Asp Ala Lys Glu Ser Ser Val Gin Thr 
130 135 140 



TTA ATA ATG AAG CAT GAA CAA GAA TTA GAA CGC ATC TCC GGT CTC ACT 
Leu He Met Lys His Glu Gin Glu Leu Glu Arg He Ser Gly Leu Thr 
145 150 155 ^ 160 

CAA GAA GAA GCT ATT AAT GAG CAA CTT CAA AGA GTA GAG GAA GAA CTG 
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Gln Glu Glu Ala He Asn Glu Gin Leu Gin Arg Val Glu Glu Glu Leu 
165 170 " 175 

TCA CAA GAT ATT GCA GTA CTT GTT AAA GAA AAA GAA AAA GAA GCT AAA 576 
Ser Gin Asp He Ala Val Leu Val Lys Glu Lys Glu Lys Glu Ala Lys 
180 185 190 

GAA AAA GTT GAT AAA ACA GCA AAA GAA TTA TTA GCT ACA GCA GTA CAA 624 
Glu Lys Val Asp Lys Thr Ala Lys Glu Leu Leu Ala Thr Ala Val Gin 
195 200 205 

AG A TTA GCA GCA GAT CAC ACA AGT GAA TCA ACG GTA TCA GTA GTT AAC 672 
Arg Leu Ala Ala Asp His Thr Ser Glu Ser Thr Val Ser Val Val Asn 
210 215 220 

TTA CCT AAT GAT GAG ATG AAA GGT CGA ATC ATT GGA CGA GAA GGA CGA 72 0 

Leu Pro Asn Asp Glu Met Lys Gly Arg He He Gly Arg Glu Gly Arg 
225 230 235 240 

AAC ATC CGC ACA CTT GAA ACT TTA ACT GGC ATT GAT TTA ATT ATT GAT 768 
Asn He Arg Thr Leu Glu Thr Leu Thr- Gly He Asp Leu He He Asp 
245 250 * 255 

GAC ACA CCA GAA GCG GTT ATA TTA TCT GGT TTT GAT CCA ATA AGA AGA 816 
Asp Thr Pro Glu Ala Val He Leu Ser Gly Phe Asp Pro He Arg Arg 
260 265 270 

GAA ATT GCT AGA ACA GCA CTT GTT AAC TTA GTA TCT GAT GGA CGT ATT 864 
Glu He Ala Arg Thr Ala Leu Val Asn Leu Val Ser Asp Gly Arg He 
275 280 285 

CAT CCA GGT AGA ATT GAA GAT ATG GTC GAA AAA GCT AGA AAA GAA GTA 912 
His Pro Gly Arg He Glu Asp Met Val Glu Lys Ala Arg Lys Glu Val 
290 295 300 

GAC GAT ATT ATT AGA GAA GCA GGT GAA CAA GCT ACA TTT GAA GTG AAC 960 
Asp Asp He He Arg Glu Ala Gly Glu Gin Ala Thr Phe Glu Val Asn 
305 310 315 320 

GCA CAT AAT ATG CAT CCT GAC TTA GTA AAA ATT GTA GGG CGT TTA AAC 1008 
Ala His Asn Met His Pro Asp Leu Val Lys He Val Gly Arg Leu Asn 
325 330 335 

TAT CGT ACG AGT TAC GGT CAA AAT GTA CTT AAA CAT TCA ATT GAA GTT 1056 
Tyr Arg Thr Ser Tyr Gly Gin Asn Val Leu Lys His Ser He Glu Val 
340 345 350 

GCG CAT CTT GCT AGT ATG TTA GCT GCT GAG CTA GGC GAA GAT GAG ACA 1104 
Ala His Leu Ala Ser Met Leu Ala Ala Glu Leu Gly Glu Asp Glu Thr 
355 360 365 

TTA GCG AAA CGA GCT GGA CTT TTA CAT GAT GTT GGT AAA GCA ATT GAT 1152 
Leu Ala Lys Arg Ala Gly Leu Leu His Asp Val Gly Lys Ala He Asp 
370 375 380 

CAT GAA GTA GAA GGT AGT CAT GTT GAA ATC GGT GTA GAA TTA GCG AAA 1200 
His Glu Val Glu Gly Ser His Val Glu He Gly Val Glu Leu Ala Lys 
385 390 395 400 

AAA TAT GGT GAA AAT GAA ACA GTT ATT AAT GCA ATC CAT TCT CAT CAT 124 8 

Lys Tyr Gly Glu Asn Glu Thr Val He Asn Ala He His Ser His His 
405 410 415 



GGT GAT GTT GAA CCT ACA TCT ATT ATA TCT ATC CTT GTT GCT GCT GCA 



1296 
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Gly Asp Val Glu Pro Thr Ser He He Ser He Leu Val Ala Ala Ala 
420 425 430 

GAT GCA TTG TCT GCG GCT CGT CCA GGT GCA AGA AAA GAA ACA TTA GAG 1344 
Asp Ala Leu Ser Ala Ala Arg Pro Gly Ala Arg Lys Glu Thr Leu Glu 
435 440 445 

AAT TAT ATT CGT CGA TTA GAA CGT TTA GAA ACG TTA TCA GAA AGT TAT 1392 
Asn Tyr He Arg Arg Leu Glu Arg Leu Glu Thr Leu Ser Glu Ser Tyr 
450 455 460 

GAT GGT GTA GAA AAA GCA TTT GCG ATT CAG GCA GGT AGA GAA ATC CGA 1440 
, Asp Gly Val Glu Lys Ala Phe Ala He Gin Ala Gly Arg Glu He Arg 
465 470 475 ' 480 

GTG ATT GTA TCT CCT GAA GAA ATT GAT GAT TTA AAA TCT TAT CGA TTG 14 88 

Val lie Val Ser Pro Glu Glu He Asp Asp Leu Lys Ser Tyr Arg Leu 
485 490 ' 495 

GCT AGA GAT ATT AAA AAT CAG ATT GAA GAT GAA TTA CAA TAT CCT GGT 1536 
Ala Arg Asp He Lys Asn Gin He Glu Asp Glu Leu Gin' Tyr Pro Gly 
500 505 510 

CAT ATC AAG GTG ACA GTT GTT CGA GAG ACT AGA GCA GTA GAA TAT GCG 1584 
His He Lys Val Thr Val Val Arg Glu Thr Arg Ala Val Glu Tyr Ala 
515 520 525 

AAA TAA 1590 
Lys 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 529 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Arg Ala Asn Pro Arg Lys Gly Gly Val Cys Val Asn Leu Leu Ser Leu 
15 10 15 

Leu Leu He Leu Leu Gly He He Leu Gly Val Val Gly Gly Tyr Val 
20 25 30 

Val Ala Arg Asn Leu Leu Leu Gin Lys Gin Ser Gin Ala Arg Gin Thr 
35 40 45 

Ala Glu Asp He Val Asn Gin Ala His Lys Glu Ala Asp Asn He Lys 
50 55 60 

Lys Glu Lys Leu Leu Glu Ala Lys Glu Glu Asn Gin He Leu Arg Glu 
65 70 • 75 80 

Gin Thr Glu Ala Glu Leu Arg Glu Arg Arg Ser Glu Leu Gin Arg Gin 
85 90 95 

Glu Thr Arg Leu Leu Gin Lys Glu Glu Asn Leu Glu Arg Lys Ser Asp 
100 105 110 

Leu Leu Asp Lys Lys Asp Glu lie Leu Glu Gin Lys Glu Ser Lys He ' 
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115 .120 125 

Glu Glu Lys Gin Gin Gin Val Asp Ala Lys Glu Ser Ser Val Gin Thr 
130 135 140 

Leu He Met Lys His Glu Gin Glu Leu Glu Arg He Ser Gly Leu Thr 
145 150 155 160 

Gin Glu Glu Ala He Asn Glu Gin Leu Gin Arg Val Glu Glu Glu Leu 
165 170 175 

Ser Gin Asp He Ala Val Leu Val Lys Glu Lys Glu Lys Glu Ala Lys 
180 185 190 

Glu Lys Val Asp Lys Thr Ala Lys Glu Leu Leu Ala Thr Ala Val Gin 
195 200 205 

Arg Leu Ala Ala Asp His Thr Ser Glu Ser Thr Val Ser Val Val Asn 
210 215 220 

Leu Pro Asn Asp Glu Met Lys Gly Arg He He Gly Arg Glu Gly Arg 
225 230 235 " 240 

Asn He Arg Thr Leu Glu Thr Leu Thr Gly He Asp Leu lie lie Asp 
245 250 " 255 

Asp Thr Pro Glu Ala Val He Leu Ser Gly Phe Asp Pro lie Arg Arg 
260 265 270 

Glu lie Ala Arg Thr Ala Leu Val Asn Leu Val Ser Asp Gly Arg lie 
275 280 285 

His Pro Gly Arg lie Glu Asp Met Val Glu Lys Ala Arg Lys Glu Val 
290 295 300 

Asp Asp lie lie Arg Glu Ala Gly Glu Gin Ala Thr Phe Glu Val Asn 
305 310 315 320 

Ala His Asn Met His Pro Asp Leu Val Lys lie Val Gly Arg Leu Asn 
325 330 335 

Tyr Arg Thr Ser Tyr Gly Gin Asn Val Leu Lys His Ser lie Glu Val 
340 345 350 

Ala His Leu Ala Ser Met Leu Ala Ala Glu Leu Gly Glu Asp Glu Thr 
355 360 365 

Leu Ala Lys Arg Ala Gly Leu Leu His Asp Val Gly Lys Ala lie Asp 
370 375 380 

His Glu Val Glu Gly Ser His Val Glu lie Gly Val Glu Leu Ala Lys 
385 390 395 400 

Lys Tyr Gly Glu Asn Glu Thr Val lie Asn Ala lie His Ser His His 
405 410 415 

Gly Asp Val Glu Pro Thr Ser lie lie Ser lie Leu Val Ala Ala Ala 
420 425 430 

Asp Ala Leu Ser Ala Ala Arg Pro Gly Ala Arg Lys Glu Thr Leu Glu 
435 440 445 

Asn Tyr lie Arg Arg Leu Glu Arg Leu Glu Thr Leu Ser Glu Ser Tyr 
450 455 460 
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Asp 


Gly 


Val 


Glu 


Lys 


Ala 


Phe 


465 








470 




Val 


lie 


Val 


Ser 


Pro 


Glu 


Glu 










485 






Ala 


Arg 


Asp 


lie 


Lys 


Asn 


Gin 








500 








His 


lie 


Lys 


Val 


Thr 


Val 


Val 






515 











-156- 



475 480 

lie Asp Asp Leu Lys Ser Tyr Arg Leu 
490 495 

He Glu Asp Glu Leu Gin Tyr Pro Gly 

505 510 
Arg Glu Thr Arg Ala Val Glu Tyr Ala 
520 525 

Lys 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = M DNA (genomic) (pllc66) " 
Unknown = Xaa 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. .3 84 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 

TCA TGG TAT TAT ATT GTT TGG AGT GAT TTG ATG AGA TTT GTC TTT GAT 4 8 

Ser Trp Tyr Tyr He Val Trp Ser Asp Leu Met Arg Phe Val Phe Asp 
15 10 is 

ATT GAT GGT ACG CTT TGT TTT GAC GGC CGA TTA ATT GAC CAG ACT ATT 96 
He Asp Gly Thr Leu Cys Phe Asp Gly Arg Leu He Asp Gin Thr He 
20 25 30 

ATT GAT ACA TTG TTA CAA TTA CAA CAT GAT GGT CAT GAA CTT ATA TTT 144 
He Asp Thr Leu Leu Gin Leu Gin His Asp Gly His Glu Leu He Phe 
35 40 45 

GCA TCA GCA CGT CCG ATT CGT GAT TTG TTG CCA GTT TTA CCA TCA GTA 192 
Ala Ser Ala Arg Pro He Arg Asp Leu Leu Pro Val Leu Pro Ser Val 
50 55 60 

TTT CAT CAG CAC ACA TTA ATT GGC GCA AAT GGT GCT ATG ATT TCA CAG 24 0 

Phe His Gin His Thr Leu He Gly Ala Asn Gly Ala Met He Ser Gin 
65 70 75 80 

CAA TCA AAG ATT TCT GTT ATC AAA CCA ATT CAT ACT GAT ACA TAT CAT 288 
Gin Ser Lys He Ser Val He Lys Pro lie His Thr Asp Thr Tyr His 
85 90 95 

CAT ATC TTC AAA ATA ATT CAA AAG TAT GAG TTA GAT TAT ATT ATT GAT 336 
His He Phe Lys He He Gin Lys Tyr Glu Leu Asp Tyr He He Asp 
100 105 no 

GAT GAT TGG AAT TAT GCT GCA CAA CTT GAC GCT GnA GAA CGC GAT TTT 3 84 

Asp Asp Trp Asn Tyr Ala Ala Gin Leu Asp Ala Xaa Glu Arg Aso Phe 
115 120 125 



WO 99/01473 



PCT/GB98/01974 



-157- 

TGAGCGTTTA GATCCACATA AGCTGGCCAG TTGTATTGAT GTTGCAAATA TCGACACGCC 444 

AATCAAGAkT ATTTTATTAA ATATAGACCC GGCACAAATT ACAACTATAT TAGACGAGCT 504 

AGATAAATAC CATCAAGAAT TGGAAATGAT TCACCATTCA AATGAGTATA ACATTGATAT 5 64 

AACAGCGCAA AATATTAACA AATATACTGC ATTACAATAT ATATTTGATG CAGATGTTAA * 624 

ATATATAGCA TTTGGTAATG ACCACAATGA TATTGTCATG TTACAACATG CTAGTAGTGG 684 

CTATATTATA GGACCATCAG AAGCATACAC ACACGCAATA TTGAAACTTG ATAAAATCAA 744 

ACACATCAAT AATAATGCAC AAGCTATTTG CAAAGTCTTA AAATCATATA AATAA 799 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:76: 

Ser Trp Tyr Tyr lie Val Trp Ser Asp Leu Met Arg Phe Val Phe Asp 
1 5 io is 

lie Asp Gly Thr Leu Cys Phe Asp Gly Arg Leu He Asp Gin Thr He 
20 25 30 

He Asp Thr Leu Leu Gin Leu Gin His Asp Gly His Glu Leu He Phe 
35 40 45 

Ala Ser Ala Arg Pro He Arg Asp Leu Leu Pro Val Leu Pro Ser Val 
50 55 60 

Phe His Gin His Thr Leu lie Gly Ala Asn Gly Ala Met He Ser Gin 
65 70 75 80 

Gin Ser Lys He Ser Val He Lys Pro lie His Thr Asp Thr Tyr His 
85 90 ~ 95 

His He Phe Lys He He Gin Lys Tyr Glu Leu Asp Tyr He He Asp 
100 105 no 

Asp Asp Trp Asn Tyr Ala Ala Gin Leu Asp Ala Xaa Glu Arg Asp Phe 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " DNA (genomic) (pllc66)" 
Unknown = Xaa 
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(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 335 -.796 



AAA TCA TAT AAA TAA 
Lys Ser Tyr Lys 



(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 



400 



448 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 

TCATGGTATT ATATTGTTTG GAGTGATTTG ATGAGATTTG TCTTTGATAT TGATGGTACG 60 

CTTTGTTTTG ACGGCCGATT AATTGACCAG ACTATTATTG ATACATTGTT ACAATTACAA 12 0 

CATGATGGTC ATGAACTTAT ATTTGCATCA GCACGTCCGA TTCGTGATTT GTTGCCAGTT 180 

TTACCATCAG TATTTCATCA GCACACATTA ATTGGCGCAA ATGGTGCTAT GATTTCACAG 24 0 

CAATCAAAGA TTTCTGTTAT CAAACCAATT CATACTGATA CATATCATCA TATCTTCAAA 300 

ATAATTCAAA AGTATGAGTT AGATTATATT ATTG ATG ATG ATT GGA ATT ATG 352 

Met Met He Gly He Met 

1 5 

CTG CAC AAC TTG ACG CTG nAG AAC GCG ATT TTT GAG CGT TTA GAT CCA 
Leu His Asn Leu Thr Leu Xaa Asn Ala He Phe Glu Arg Leu Asp Pro 
10 15 20 

CAT AAG CTG GCC AGT TGT ATT GAT GTT GCA AAT ATC GAC ACG CCA ATC 
His Lys Leu Ala Ser Cys He Asp Val Ala Asn He Asp Thr Pro He 
25 30 35 

AAG AkT ATT TTA TTA AAT ATA GAC CCG GCA CAA ATT AC A ACT ATA TTA 4 96 

Lys Xaa He Leu Leu Asn He Asp Pro Ala Gin He Thr Thr He Leu 
40 45 so 

GAC GAG CTA GAT AAA TAC CAT CAA GAA TTG GAA ATG ATT CAC CAT TCA 544 
Asp Glu Leu Asp Lys Tyr His Gin Glu Leu Glu Met He His His Ser 
55 60 - 65 70 

AAT GAG TAT AAC ATT GAT ATA ACA GCG CAA AAT ATT AAC AAA TAT ACT 592 
Asn Glu Tyr Asn He Asp He Thr Ala Gin Asn He Asn Lys Tyr Thr 
75 80 ' 85 

GCA TTA CAA TAT ATA TTT GAT GCA GAT GTT AAA TAT ATA GCA TTT GGT 640 
Ala Leu Gin Tyr He Phe Asp Ala Asp Val Lys Tyr He Ala Phe Gly 
90 95 100 

AAT GAC CAC AAT GAT ATT GTC ATG TTA CAA CAT GCT AGT AGT GGC TAT 688 
Asn Asp His Asn Asp He Val Met Leu Gin His Ala Ser Ser Gly Tyr 
105 no us 

ATT ATA GGA CCA TCA GAA GCA TAC ACA CAC GCA ATA TTG AAA CTT GAT 736 
He He Gly Pro Ser Glu Ala Tyr Thr His Ala He Leu Lys Leu Asp 
120 125 130 

AAA ATC AAA CAC ATC AAT AAT AAT GCA CAA GCT ATT TGC AAA GTC TTA 7 84 

Lys He Lys His lie Asn Asn Asn Ala Gin Ala He Cys Lys Val Leu 
135 140 145 ' " 150 



799 



WO 99/01473 



PCT/GB98/01974 



-159- 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Met Met He Gly He Met Leu His Asn Leu Thr Leu Xaa Asn Ala He 
1 5 10 15 

Phe Glu Arg Leu Asp Pro His Lys Leu Ala Ser Cys He Asp Val Ala 
20 25 30 

Asn He Asp Thr Pro He Lys Xaa lie Leu Leu Asn He Asp Pro Ala 
35 40 45 

Gin He Thr Thr He Leu Asp Glu Leu Asp Lys Tyr His Gin Glu Leu 
50 55 60 

Glu Met He His His Ser Asn Glu Tyr Asn He Asp He Thr Ala Gin 
65 70 75 80 

Asn He Asn Lys Tyr Thr Ala Leu Gin Tyr He Phe Asp Ala Asp Val 
85 90 95 

Lys Tyr He Ala Phe Gly Asn Asp His Asn Asp He Val Met Leu Gin 
100 105 * HO 

His Ala Ser Ser Gly Tyr He He Gly Pro Ser Glu Ala Tyr Thr His 
115 120 125 

Ala He Leu Lys Leu Asp Lys lie Lys His lie Asn Asn Asn Ala Gin 
130 135 140 

Ala lie Cys Lys Val Leu Lys Ser Tyr Lys 
145 150 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1236 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) p5c34" 



( ix ) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .1233 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

AGC TCT ACG TTC AAC ATT TTT CCA TTG GGG ATT CCA TGC TTG GGC TGT 
Ser Ser Thr Phe Asn He Phe Pro Leu Gly lie Pro Cys Leu Gly Cys 
15 10 15 

TTA TGG TGT TGT TGC GTT ACG TTG GCA TAT TCG CAA TTC CGT AAA GGT 
Leu Trp Cys Cys Cys Val Thr Leu Ala Tyr Ser Gin Phe Arg Lys Gly 
20 25 30 
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144 



192 



GAA CCA GGT TTA TTA TCT AGA ACT TTA CGT CCT CTT TTA GGT GAT AAA 

Glu Pro Gly Leu Leu Ser Arg Thr Leu Arg Pro Leu Leu Gly Asp Lys 
35 40 4 5 

GTA GAA GGT CCT ATT GGG ATT TTT ATT GAC GTT TTA TCT GTA TTT GCG 

Val Glu Gly Pro lie Gly He Phe He Asp Val Leu Ser Val Phe Ala 
50 55 60 

ACA ATC GTT GGG GTA GCC GTT TCG TTA GGT ATG GGT GCT CTA CAA ATT 240 

Thr He Val Gly Val Ala Val Ser Leu Gly Met Gly Ala Leu Gin He 
65 70 75 so 

AAT GGT GGT TTA CAT TAC TTG TTC AAT GTT CCA AAC AAT ACG TTT GTA 288 

Asn Gly Gly Leu His Tyr Leu Phe Asn Val Pro Asn Asn Thr Phe Val 
85 90 95 

CAA GCG ATT ATC ATC ATT GTT GTT ACT ATC TTA TTT ATA GCA AGT GCA 336 

Gin Ala He He He He Val Val Thr He Leu Phe He Ala Ser Ala 
100 105 no 

TGG TCT GGA TTA AGT AAA GGT ATT CAA TAC TTA AGT AAC TTG AAC ATT 

Trp Ser Gly Leu Ser Lys Gly He Gin Tyr Leu Ser Asn Leu Asn He 
115 120 125 



384 



GGT TTA GGT ACT ATT TTA ATG GTA GCT GCT TTA ATT GTT GGA CCA ACT 4 32 

Gly Leu Gly Thr lie Leu Met Val Ala Ala Leu He Val Gly Pro Thr 
130 135 140 

GTT CTT ATT TTA AAT ATG TTA ACT AGC TCT ACG GGT AGT TTA CTA AAC 480 
Val Leu He Leu Asn Met Leu Thr Ser Ser Thr Gly Ser Leu Leu Asn 
145 150 155 160 

ACA TTC TTG TTT AAT AGT TTT GAT ACA GCA GCT TTA AAT CCT CAA AAA 52 8 

Thr Phe Leu Phe Asn Ser Phe Asp Thr Ala Ala Leu Asn Pro Gin Lys 
165 170 175 

CGT GAA TGG ATG TCT TCA TGG ACA CTT TAT TAC TGG GGT TGG TGG TTA 576 
Arg Glu Trp Met Ser Ser Trp Thr Leu Tyr Tyr Trp Gly Trp Trp Leu 
180 185 " 190 

AGT TGG AGT CCA TTC GTT GGA GTG TTT ATT GCA CGA GTT TCA AAA GGA 62 4 

Ser Trp Ser Pro Phe Val Gly Val Phe He Ala Arg Val Ser Lys Gly 
195 200 205 

CGT TCA ATT AGA GAG TTC ATT TCT GGT GTC TTG CTA GTT CCA GCA ATT 672 
Arg Ser He Arg Glu Phe He Ser Gly Val Leu Leu Val Pro Ala He 
210 215 220 

GTT AGT TTT GTT TGG TTT AGT GTC TTT GGT GTA TTA GGC ATC GAG ACA 720 
Val Ser Phe Val Trp Phe Ser Val Phe Gly Val Leu Gly He Glu Thr 
225 230 235 240 

GGT AAG AAA CAC AAA GAA ATT TTT GAT ATG ACT CCT GAA ACA CAG CTA 768 
Gly Lys Lys His Lys Glu He Phe Asp Met Thr Pro Glu Thr Gin Leu 
245 250 255 

TTT GGA GTG TTT AAT CAT GTG CCA TTT GGC ATT GTT TTA TCG TTG ATT 816 
Phe Gly Val Phe Asn His Val Pro Phe Gly He Val Leu Ser Leu He 
260 265 270 

GCA TTA TTA TTA ATT GCA TCA TTC TTT ATT ACA TCT GCT GAC TCA GCA 864 
Ala Leu Leu Leu He Ala Ser Phe Phe He Thr Ser Ala Asp Ser Ala 
275 280 285 
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305 



385 



TTT 


GTA 


TTA 


GGA 


ATG 


CAA 


ACA 


ACA 


TTT 


GGT 


TCA 


TTA 


AAT 


CCA TCT 


Phe 


Val 


Leu 


Gly 


Met 


Gin 


Thr 


Thr 


Phe 


\j x y 


Ser 


Leu 


Asn 


Pro Ser 


290 










295 








300 






ATG 


GTA 


AAA 


GTT 


GTT 


TGG 


GGA 


ATT 


TCA 


CAG 


GCC 


TTA 


ATA 


GCA TTT 


Met 


Val 


Lys 


Val 


Val 


Trn 


Glv 








/\ia 


Leu 


lie 


Ala Phe 










310 










315 








320 


CTT 


TTA 


TTA 


GCT 


GGT 


GGC 


GGT 


AAC 


GGC 


GCT 


GAA 


GCT 


TTA 


AAT GCG 


Leu 


Leu 


Leu 


Ala 


Gly 


Gly 


Gly 


Asn 


Gly 


Ala 


Glu 


Ala 


Leu 


Asn Ala 








325 










330 










335 


CAA 


AGT 


GCT 


GCA 


ATT 


ATA 




vUn 


TTP 

1 1 V— 




TTC 


ILL 


TTT 


GTC GTC 


Gin 


Ser 


Ala 


Ala 


He 


He 


Ser 


Ala 


Phe 


Pro 


Phe 


Ser 


Phe 


Val Val 






340 










345 










350 




CTC 


ATG 


ATG 


GTA 


AGT 


TTC 


TAC 


AAG 


GAT 


GCG 


AAC 


CAG 


GAA 


CGT AAA 


Leu 


Met 


Met 


Val 


Ser 


Phe 


Tyr 


Lys 


Asp 


Ala 


Asn 


Gin 


Glu Arg Lys 




355 










360 










365 






CTA 


GGT 


TTA 


ACA 


TTG 


ACT 


CCG 


AAT 


AAA 


CAT 


CGC 


TTA 


CAA 


GAA TAT 


Leu 


Gly 


Leu 


Thr 


Leu 


Thr 


Pro 


Asn 


Lys 


His 


Arg 


Leu 


Gin 


Glu Tyr 


370 










375 










380 






AAG 


AGT 


CAA 


CAA 


GAA 


GAT 


TAT 


GAA 


TCT 


GAC 


ATT 


CTT 


GAA 


AAG CGT 


Lys 


Ser 


Gin 


Gin 


Glu 


Asp 


Tyr 


Glu 


Ser 


Asp 


He 


Leu 


Glu 


Lys Arg 










390 










395 








400 


TCA 


CGT 


AGA 


AAT 


ATA 


GAG 


AAA 


AAA 


GAT 


AAC 


TAA 








Ser 


Arg 


Arg 


Asn 


He 


Glu 


Lys 


Lys 


Asp 


Asn 
















405 










410 













(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Ser Ser Thr Phe Asn He Phe Pro Leu Gly He Pro Cys Leu Gly Cys 
1 5 10 15 

Leu Trp Cys Cys Cys Val Thr Leu Ala Tyr Ser Gin Phe Arg Lys Gly 
20 25 30 

Glu Pro Gly Leu Leu Ser Arg Thr Leu Arg Pro Leu Leu Gly Asp Lys 
35 40 45 

Val Glu Gly Pro He Gly He Phe He Asp Val Leu Ser Val Phe Ala 
50 55 60 

Thr He Val Gly Val Ala Val Ser Leu Gly Met Gly Ala Leu Gin He 
65 70 75 80 

Asn Gly Gly Leu His Tyr Leu Phe Asn Val Pro Asn Asn Thr Phe Val 
85 90 95 

Gin Ala He He He He Val Val Thr He Leu Phe He Ala Ser Ala 
100 105 HO 



912 



960 



1008 



1056 



1104 



1152 



1200 



1236 
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Trp Ser Gly Leu Ser Lys Gly He Gin Tyr Leu Ser Asn Leu Asn He 
X1 5 120 125 

Gly Leu Gly Thr He Leu Met Val Ala Ala Leu He Val Gly Pro Thr 
130 135 140 

Val Leu He Leu Asn Met Leu Thr Ser Ser Thr Gly Ser Leu Leu Asn 
145 150 155 leo 

Thr Phe Leu Phe Asn Ser Phe Asp Thr Ala Ala Leu Asn Pro Gin Lys 
165 170 175 

Arg Glu Trp Met Ser Ser Trp Thr Leu Tyr Tyr Trp Gly Trp Trp Leu 
180 185 190 

Ser Trp Ser Pro Phe Val Gly Val Phe He Ala Arg Val Ser Lys Gly 
195 200 205 

Arg Ser He Arg Glu Phe He Ser Gly Val Leu Leu Val Pro Ala He 
210 215 220 

Val Ser Phe Val Trp Phe Ser Val Phe Gly Val Leu Gly He Glu Thr 
225 230 235 " 240 

Gly Lys Lys His Lys Glu He Phe Asp Met Thr Pro Glu Thr Gin Leu 
245 250 255 

Phe Gly Val Phe Asn His Val Pro Phe Gly He Val Leu Ser Leu He 
260 265 270 

Ala Leu Leu Leu He Ala Ser Phe Phe He Thr Ser Ala Asp Ser Ala 
275 280 285 

Thr Phe Val Leu Gly Met Gin Thr Thr Phe Gly Ser Leu Asn Pro Ser 
290 295 300 

Ser Met Val Lys Val Val Trp Gly He Ser Gin Ala Leu He Ala Phe 
305 310 315 320 

Val Leu Leu Leu Ala Gly Gly Gly Asn Gly Ala Glu Ala Leu Asn Ala 
325 330 335 

He Gin Ser Ala Ala He He Ser Ala Phe Pro Phe Ser Phe Val Val 
340 345 350 

He Leu Met Met Val Ser Phe Tyr Lys Asp Ala Asn Gin Glu Arg Lys 
355 360 365 

Phe Leu Gly Leu Thr Leu Thr Pro Asn Lys His Arg Leu Gin Glu Tyr 
370 375 380 

He Lys Ser Gin Gin Glu Asp Tyr Glu Ser Asp He Leu Glu Lys Arg 
385 390 395 400 

Gin Ser Arg Arg Asn He Glu Lys Lys Asp Asn 
405 410 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (pl0cl8) " 
Unknown = N 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

ATTATAGAGT TTTTCCGCTG TTTCTAAAGT TAAATTATTC ACTTTTCTTT TCCCGCTTCT 6 0 

TAATTCACTT ATACCACCAT AAGAAACATC AGTGTCTTGA TTAATTCTAT AATTAGATAT 12 0 

TGATYTATCA TTTAGTARTT TTTCTATTGT ATTATNAATT TCTTTAARCT GGTTCWTAAT 18 0 

TTTNGTCNAA ATGAAAGAAT AATTTATTTT NTCTCTANGT TATATTAATC AATNAANTAA 24 0 

TATTANAGTT GCAANTTAAG NATAGAGAGT TNATTTTTTT CCTTTANATT CCTCCTTGGT 3 00 

CNCTNAANAT TANCCANCCT NCCCCTTATT TTTAAATTTG GTGGGTNNAA AGGGGTTNTT 3 60 

CTNGGCCCCN TTTTTNCCCT NTTTTTTTTN ANAAAGNANC CGNAATTCTG GATCCTAACT 420 

TTTANTNTTN NCTTGGGAAC TCTAATTCAA GGGNCCTCNG CTCGNCNAGC NCTAATT 4 77 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (pl2c3)" 
Unknown = Xaa 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .1209 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

GAA AGA AAG CGT TTT CAC ATA ACA AAG GGG GAG TTT CAA ATG AAA GTC 4 8 

Glu Arg Lys Arg Phe His He Thr Lys Gly Glu Phe Gin Met Lys Val 
15 10 is 

GAA GTT TAT AAA GGA GCG CAA GGT AAA CAT AAC CTT AAA GAT TAT GAA 96 
Glu Val Tyr Lys Gly Ala Gin Gly Lys His Asn Leu Lys Asp Tyr Glu 
20 25 30 

GAA ACA TAT AAT ACT TTT GAT TGG AAA GAC GTA GAA CAA GCA TTT TCT 144 
Glu Thr Tyr Asn Thr Phe Asp Trp Lys Asp Val Glu Gin Ala Phe Ser 
35 40 45 

TGG AGT GAA ACT GGA AAA ATG AAC ATG GCA TAT GAA TGC ATA GAT CGC 192 
Trp Ser Glu Thr Gly Lys Met Asn Met Ala Tyr Glu Cys He Asp Arg 
5 0 5 5 60 

CAT GTA GAT CAA GGA TTA GGG GAT AAA ATA GCG TTA AAT TAG AAA GAT 24 0 

His Val Asp Gin Gly Leu Gly Asp Lys He Ala Leu Asn Tyr Lys Asp 
65 70 75 80 

GAG CAC AGA AAA GAA TCG TAT ACT TAT AAA GAT ATG CAA CGG TTA TCT 2 88 

Glu His Arg Lys Glu Ser Tyr Thr Tyr Lys Asp Met Gin Arg Leu Ser 
85 90 95 
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AAT AAA GCA GCG AAT GTT TTG TCT GAA CAT GCA GAA GTT GAC AAA GGT 33 6 

Asn Lys Ala Ala Asn Val Leu Ser Glu His Ala Glu Val Asp Lys Gly 
100 105 no 

GAC AGA GTA TTT ATA TTT ATG TCG CGT ACA CCT GAA CTA TAT TTT GCG 3 84 

Asp Arg Val Phe He Phe Met Ser Arg Thr Pro Glu Leu Tyr Phe Ala 

120 125 

TTG TTA GGT GTT TTA AAA ATT GGT GCA ATT GTT GGG CCG TTA TTT GAA 4 32 

Leu Leu Gly Val Leu Lys He Gly Ala He Val Gly Pro Leu Phe Glu 
130 135 i4 0 

GCA TTT ATG GAA AAG GCA GTT GCG GAT AGA TTA GAG AAC AGT GAA GCT 4 80 

Ala Phe Met Glu Lys Ala Val Ala Asp Arg Leu Glu Asn Ser Glu Ala 
145 150 155 160 

AAA GTG TTA ATT ACT AAT AAG GCA TTG TTA CCT CGA GTA CCT GTA GAT 52 8 

Lys Val Leu He Thr Asn Lys Ala Leu Leu Pro Arg Val Pro Val Asp 
165 170 ~ 175 

AAA TTA CCA AAC TTG AAA AAA ATT GTT GTC GTA GAT GAG GAT GTA GAA 576 
Lys Leu Pro Asn Leu Lys Lys He Val Val Val Asp Glu Asp Val Glu 
180 185 190 

GAC AAT TAC ATA GAC TTC ATT AGT TTG ATG GAA ACT GCT AGC GAT GAA 624 
Asp Asn Tyr He Asp Phe He Ser Leu Met Glu Thr Ala Ser Asp Glu 
195 200 205 

TTT GAC ATT GAA TGG TTA AAG TCG GAT GAT GGT TTG ATT TTA CAT TAT 672 
Phe Asp. He Glu Trp Leu Lys Ser Asp Asp Gly Leu He Leu His Tyr 
210 215 220 

ACA TCA GGT TCT ACT GGG CAA CCT AAA GGT GTA TTG CAT GTT CAA CAA 720 
Thr Ser Gly Ser Thr Gly Gin Pro Lys Gly Val Leu His Val Gin Gin 
225 230 235 240 

GCA ATG TTA GTG CAC TAT ATT TCT GGr AAA TAT GTA TTA GAT TTA CAA 768 
Ala Met Leu Val His Tyr He Ser Gly Lys Tyr Val Leu Asp Leu Gin 
245 250 255 

GAA GAT GAT GTT TAT TGG TGT ACA GCA GAT CCA GGT TGG GTT ACA GGA 816 
Glu Asp Asp Val Tyr Trp Cys Thr Ala Asp Pro Gly Trp Val Thr Gly 
260 265 270 

ACA TCT TAT GGT ATT TTT GCA CCA TGG TTA AAT GGC GCT ACA AAT TGT 864 
Thr Ser Tyr Gly He Phe Ala Pro Trp Leu Asn Gly Ala Thr Asn Cys 
275 280 285 

ATA GCT GGT GGT CGC TTT TCG CCA GAA CAG TGG TAT AGT ATG ATT GAA 912 
He Ala Gly Gly Arg Phe Ser Pro Glu Gin Trp Tyr Ser Met He Glu 
290 295 300 

GAT TTT AAA GTG ACG ATT TGG TAT ACG GCA CCA ACA GCT TTA AGA ATG 960 
Asp Phe Lys Val Thr He Trp Tyr Thr Ala Pro Thr Ala Leu Arg Met 
305 310 315 320 

TTA ATG AGT GCT GGT GAC GAT ATT GTT GAG AAA TAT GAC TTG TCA TCG 1008 
Leu Met Ser Ala Gly Asp Asp He Val Glu Lys Tyr Asp Leu Ser Ser 
325 330 * 335 

TTA CGT TCG ATT CTA TCA GTA GGT GAG CCT TTA AAT CCT GAA GTT ATA 1056 
Leu Arg Ser He Leu Ser Val Gly Glu Pro Leu Asn Pro Glu Val He 
340 345 350 
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AAA TGG GCG AAA AAA GTA 
Lys Trp Ala Lys Lys Val 
355 

ATG ACA GAA ACA GGT GGA 
Met Thr Glu Thr Gly Gly 
370 

GTn CAA GCT TGG CTC mAT 
Val Gin Ala Trp Leu Xaa 
385 390 

AAT TAT CGA TGA 
Asn Tyr Arg 



-165- 

TAC GGT TTA ACG GTG TTA 
Tyr Gly Leu Thr Val Leu 
360 

CAT ATG ATT GTT AAC TAT 
His Met He Val Asn Tyr 
375 380 

GGG CAA ACC ATT ACC TGG 
Gly Gin Thr He Thr Trp 
395 



GAT ACT TGG TGG 1104 

Asp Thr Trp Trp 

365 

CCA ACG ATG GAC 1152 
Pro Thr Met Asp 



TAT TCA AGC TGC 1200 
Tyr Ser Ser Cys 
400 

1212 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 03 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
Glu Arg Lys Arg Phe His He Thr Lys Gly Glu Phe Gin Met Lys Val 
1 5 io 15 

Glu Val Tyr Lys Gly Ala Gin Gly Lys His Asn Leu Lys Asp Tyr Glu 
20 25 ' 30 

Glu Thr Tyr Asn Thr Phe Asp Trp Lys Asp Val Glu Gin Ala Phe Ser 
35 40 45 

Trp Ser Glu Thr Gly Lys Met Asn Met Ala Tyr Glu Cys He Asp Arq 
50 55 60 

His Val Asp Gin Gly Leu Gly Asp Lys He Ala Leu Asn Tyr Lys Asd 
65 70 75 80 

Glu His Arg Lys Glu Ser Tyr Thr Tyr Lys Asp Met Gin Arg Leu Ser 
85 90 95 

Asn Lys Ala Ala Asn Val Leu Ser Glu His Ala Glu Val Asp Lys Gly 
100 105 no 

Asp Arg Val Phe He Phe Met Ser Arg Thr Pro Glu Leu Tyr Phe Ala 
115 120 125 

Leu Leu Gly Val Leu Lys He Gly Ala He Val Gly Pro Leu Phe Glu 
130 135 140 

Ala Phe Met Glu Lys Ala Val Ala Asp Arg Leu Glu Asn Ser Glu Ala 
145 150 155 i 6 o 

Lys Val Leu He Thr Asn Lys Ala Leu Leu Pro Arg Val Pro Val Asp 
165 170 ^ 175 

Lys Leu Pro Asn Leu Lys Lys lie Val Val Val Asp Glu Asp Val Glu 
180 185 " 190 

Asp Asn Tyr He Asp Phe He Ser Leu Met Glu Thr Ala Ser Asp Glu 
195 200 205 
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Phe Asp He Glu Trp Leu Lys Ser Asp Asp Gly Leu He Leu His Tyr 
210 215 220 

Thr Ser Gly Ser Thr Gly Gin Pro Lys Gly Val Leu His Val Gin Gin 
225 230 235 240 

Ala Met Leu Val His Tyr He Ser Gly Lys Tyr Val Leu Asp Leu Gin 
245 250 255 

Glu Asp Asp Val Tyr Trp Cys Thr Ala Asp Pro Gly Trp Val Thr Gly 
260 265 270 

Thr Ser Tyr Gly He Phe Ala Pro Trp Leu Asn Gly Ala Thr Asn Cys 
275 280 285 

lie Ala Gly Gly Arg Phe Ser Pro Glu Gin Trp Tyr Ser Met He Glu 
290 295 300 

Asp Phe Lys Val Thr He Trp Tyr Thr Ala Pro Thr Ala Leu Arg Met 
305 310 315 320 

Leu Met Ser Ala Gly Asp Asp lie Val Glu Lys Tyr Asp Leu Ser Ser 
325 330 335 

Leu Arg Ser lie Leu Ser Val Gly Glu Pro Leu Asn Pro Glu Val He 
340 345 350 

Lys Trp Ala Lys Lys Val Tyr Gly Leu Thr Val Leu Asp Thr Trp Trp 
355 360 365 

Met Thr Glu Thr Gly Gly His Met lie Val Asn Tyr Pro Thr Met Asp 
370 375 380 

Val Gin Ala Trp Leu Xaa Gly Gin Thr lie Thr Trp Tyr Ser Ser Cys 
385 390 395 400 

Asn Tyr Arg 



(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3147 base pairs 

(B) TYPE: nucieic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (pl4b25) " 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. .3144 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

ATG AAA ATT CTC AGC CTG CGC CTG AAA AAC CTG AAC TCA TTA AAA GGC 4 8 

Met Lys lie Leu Ser Leu Arg Leu Lys Asn Leu Asn Ser Leu Lys Gly 
1 5 10 15 

GAA TGG AAG ATT GAT TTC ACC CGC GAG CCG TTC GCC AGC AAC GGG CTG 96 

Glu Trp Lys lie Asp Phe Thr Arg Glu Pro Phe Ala Ser Asn Gly Leu 
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20 25 30 

TTT GCT ATT ACC GGC CCA ACA GGT GCG GGG AAA ACC ACC CTG CTG GAC 14 4 

Phe Ala lie Thr Gly Pro Thr Gly Ala Gly Lys Thr Thr Leu Leu Asp 
35 40 45 

GCC ATT TGT CTG GCG CTG TAT CAC GAA ACT CCG CGT CTC TCT AAC GTT 192 
Ala He Cys Leu Ala Leu Tyr His Glu Thr Pro Arg Leu Ser Asn Val 
50 55 60 

TCA CAA TCG CAA AAT GAT CTC ATG ACC CGC GAT ACC GCC GAA TGT CTG 24 0 

Ser Gin Ser Gin Asn Asp Leu Met Thr Arg Asp Thr Ala Glu Cys Leu 
65 70 75 80 

GCG GAG GTG GAG TTT GAA GTG AAA GGT GAA GCG TAC CGT GCA TTC TGG 28 8 

Ala Glu Val Glu Phe Glu Val Lys Gly Glu Ala Tyr Arg Ala Phe Trp 
85 90 95 

AGC CAG AAT CGG GCG CGT AAC CAA CCC GAC GGT AAT TTG CAG GTG CCA 3 36 

Ser Gin Asn Arg Ala Arg Asn Gin Pro Asp Gly Asn Leu Gin Val Pro 
100 105 110 

CGC GTA GAG CTG GCG CGC TGC GCC GAC GGC AAA ATT CTC GCC GAC AAA 3 84 

Arg Val Glu Leu Ala Arg Cys Ala Asp Gly Lys He Leu Ala Asp Lys 
115 120 125 

GTG AAA GAT AAG CTG GAA CTG ACA GCG ACG TTA ACC GGG CTG GAT TAC 4 32 

Val Lys Asp Lys Leu Glu Leu Thr Ala Thr Leu Thr Gly Leu Asp Tyr 
130 13 5 14 0 

GGG CGC TTC ACC CGT TCG ATG CTG CTT TCG CAG GGG CAA TTT GCT GCC 4 80 

Gly Arg Phe Thr Arg Ser Met Leu Leu Ser Gin Gly Gin Phe Ala Ala 
145 150 155 160 

TTC CTG AAT GCC AAA CCC AAA GAA CGC GCG GAA TTG CTC GAG GAG TTA 52 8 

Phe Leu Asn Ala Lys Pro Lys Glu Arg Ala Glu Leu Leu Glu Glu Leu 
165 170 175 

ACC GGC ACT GAA ATC TAC GGG CAA ATC TCG GCG ATG GTT TTT GAG CAG 576 
Thr Gly Thr Glu He Tyr Gly Gin He Ser Ala Met Val Phe Glu Gin 
180 185 190 

CAC AAA TCG GCC CGC ACA GAG CTG GAG AAG CTG CAA GCG CAG GCC AGC 624 
His Lys Ser Ala Arg Thr Glu Leu Glu Lys Leu Gin Ala Gin Ala Ser 
195 200 205 

GGC GTC ACG TTG CTC ACG CCG GAA CAA GTG CAA TCG CTG ACA GCG AGT 67 2 

Gly Val Thr Leu Leu Thr Pro Glu Gin Val Gin Ser Leu Thr Ala Ser 
210 215 220 

TTG CAG GTA CTT ACT GAC GAA GAA AAA CAG TTA ATT ACC GCG CAG CAG 72 0 

Leu Gin Val Leu Thr Asp Glu Glu Lys Gin Leu He Thr Ala Gin Gin 
225 230 235 240 

CAA GAA CAA CAA TCG CTA AAC TGG TTA ACG CGT CAG GAC GAA TTG CAG 768 
Gin Glu Gin Gin Ser Leu Asn Trp Leu Thr Arg Gin Asp Glu Leu Gin 
245 250 255 

CAA GAA GCC AGC CGC CGT CAG CAG GCC TTG CAA CAG GCG TTA GCC GAA 816 
Gin Glu Ala Ser Arg Arg Gin Gin Ala Leu Gin Gin Ala Leu Ala Glu 
260 265 270 

GAA GAA AAA GCG CAA CCT CAA CTG GCG GCG CTT AGT CTG GCA CAA CCG 864 
Glu Glu Lys Ala Gin Pro Gin Leu Ala Ala Leu Ser Leu Ala Gin Pro 
275 280 285 
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GCA CGA AAT CTT CGT CCA CAC TGG GAA CGC ATC GCA GAA CAC AGC GCG 912 
Ala Arg Asn Leu Arg Pro His Trp Glu Arg He Ala Glu His Ser Ala 
290 295 300 

GCG CTG GCG CAT ATT CGC CAG CAG ATT GAA GAA GTA AAT ACT CGC TTA 960 
Ala Leu Ala His He Arg Gin Gin He Glu Glu Val Asn Thr Arg Leu 
305 310 315 320 

CAG AGC ACA ATG GCG CTT CGC GCG AGC ATT CGC CAC CAC GCG GCG AAG 1008 
Gin Ser Thr Met Ala Leu Arg Ala Ser He Arg His His Ala Ala Lys 
325 330 335 

CAG TCA GCA GAA .TTA CAG CAG CAG CAA CAA AGC CTG AAT ACC TGG TTA 1056 
Gin Ser Ala Glu Leu Gin Gin Gin Gin Gin Ser Leu Asn Thr Trp Leu 
340 345 350 

CAG GAA CAC GAC CGC TTC CGT CAG TGG AAC AAC GAA CCG GCG GGT TGG 1104 
Gin Glu His Asp Arg Phe Arg Gin Trp Asn Asn Glu Pro Ala Gly Trp 
355 360 365 

CGT GCG CAG TTC TCC CAA CAA ACC AGC GAT CGC GAG CAT CTG CGG CAA 1152 
Arg Ala Gin Phe Ser Gin Gin Thr Ser Asp Arg Glu His Leu Arg Gin 
370 375 380 

TGG CAG CAA CAG TTA ACC CAT GCT GAG CAA AAA CTT AAT GCG CTT GCG 1200 
Trp Gin Gin Gin Leu Thr His Ala Glu Gin Lys Leu Asn Ala Leu Ala 
385 390 395 400 

GCG ATC ACG TTG ACG TTA ACC GCC GAT GAA GTT GCT ACC GCC CTG GCG 124 8 

Ala He Thr Leu Thr Leu Thr Ala Asp Glu Val Ala Thr Ala Leu Ala 
405 410 415 

CAA CAT GCT GAG CAA CGC CCA CTG CGT CAG CAC CTG GTC GCG CTG CAT 1296 
Gin His Ala Glu Gin Arg Pro Leu Arg Gin His Leu Val Ala Leu His 
420 425 430 

GGA CAG ATT GTT CCC CAA CAA AAA CGT CTG GCG CAG TTA CAG GTC GCT 134 4 

Gly Gin He Val Pro Gin Gin Lys Arg Leu Ala Gin Leu Gin Val Ala 
435 440 445 

ATC CAG AAT GTC ACG CAA GAA CAG ACG CAA CGT AAC GCC GCA CTT AAC 13 92 

He Gin Asn Val Thr Gin Glu Gin Thr Gin Arg Asn Ala Ala Leu Asn 
450 455 460 

GAA ATG CGC CAG CGT TAT AAA GAA AAG ACG CAG CAA CTT GCC GAT GTG 14 4 0 

Glu Met Arg Gin Arg Tyr Lys Glu Lys Thr Gin Gin Leu Ala Asp Val 
465 470 ' 475 480 

AAA ACC ATT TGC GAG CAG GAA GCG CGC ATC AAA ACG CTG GAA GCT CAA 14 8 8 

Lys Thr He Cys Glu Gin Glu Ala Arg He Lys Thr Leu Glu Ala Gin 
485 490 495 

CGT GCA CAG TTA CAG GCG GGT CAG CCT TGC CCA CTT TGT GGT TCC ACC 153 6 

Arg Ala Gin Leu Gin Ala Gly Gin Pro Cys Pro Leu Cys Gly Ser Thr 
500 505 510 



AGC CAC CCG GCG GTC GAG GCG TAT CAG GCG CTG GAG CCT GGC GTT AAT 1584 
Ser His Pro Ala Val Glu Ala Tyr Gin Ala Leu Glu Pro Gly Val Asn 
515 520 525 

CAG TCT CGA TTA CTG GCG CTG GAA AAC GAA GTT AAA AAG CTC GGT GAA 1632 
Gin Ser Arg Leu Leu Ala Leu Glu Asn Glu Val Lys Lys Leu Gly Glu 
530 535 540 
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GAA GGT GCG ACG CTA CGT GGG CAA CTG GAC GCC ATA ACA AAG CAG CTT 1680 
Glu Gly Ala Thr Leu Arg Gly Gin Leu Asp Ala He Thr Lys Gin Leu 
545 550 555 560 

CAG CGT GAT GAA AAC GAA GCG CAA AGC CTC CGA CAA GAT GAG CAA GCA 172 8 

Gin Arg Asp Glu Asn Glu Ala Gin Ser Leu Arg Gin Asp Glu Gin Ala 
565 570 575 

CTT ACT CAA CAA TGG CAA GCC GTC ACG GCC AGC CTC AAT ATC ACC TTG 1776 
Leu Thr Gin Gin Trp Gin Ala Val Thr Ala Ser Leu Asn He Thr Leu 
580 585 590 

CAG CCA CTG GAC GAT ATT CAA CCG TGG CTG GAT GCA CAA GAT GAG CAC 1824 
Gin Pro Leu Asp Asp He Gin Pro Trp Leu Asp Ala Gin Asp Glu His 
595 600 605 

GAA CGC CAG CTG CGG TTA CTC AGC CAA CGG CAT GAA TTA CAA GGG CAG 1872 
Glu Arg Gin Leu Arg Leu Leu Ser Gin Arg His Glu Leu Gin Gly Gin 
610 615 620 

ATT GCC GCG CAT AAT CAG CAA ATT ATC CAG TAT CAA CAG CAA ATT GAA 192 0 

He Ala Ala His Asn Gin Gin He He Gin Tyr Gin Gin Gin He Glu 
625 630 635 640 

CAA CGC CAG CAA CTA CTT TTA ACG ACA TTG ACG GGT TAT GCA CTG ACA 1968 
Gin Arg Gin Gin Leu Leu Leu Thr Thr Leu Thr Gly Tyr Ala Leu Thr 
645 650 ' 655 

TTG CCA CAG GAA GAT GAA GAA GAG AGC TGG TTG GCG ACA CGT CAG CAA 2 016 

Leu Pro Gin Glu Asp Glu Glu Glu Ser Trp Leu Ala Thr Arg Gin Gin 
660 665 670 

GAA GCG CAG AGC TGG CAG CAA CGC CAG AAC GAA TTA ACC GCG CTG CAA 2 064 

Glu Ala Gin Ser Trp Gin Gin Arg Gin Asn Glu Leu Thr Ala Leu Gin 
675 680 685 

AAC CGT ATT CAG CAG CTG ACG CCG ATT CTG GAA ACG TTG CCG CAA AGT 2112 
Asn Arg He Gin Gin Leu Thr Pro lie Leu Glu Thr Leu Pro Gin Ser 
690 695 700 

GAT GAA CTC CCG CAC TGC GAA GAA ACT GTG GTA TTG GAA AAC TGG CGG 2160 
Asp Glu Leu Pro His Cys Glu Glu Thr Val Val Leu Glu Asn Trp Arg 
705 710 715 720 

CAG GTA CAT GAA CAA TGT CTC GCA TTA CAC AGC CAG CAG CAG ACG TTA 2208 
Gin Val His Glu Gin Cys Leu Ala Leu His Ser Gin Gin Gin Thr Leu 
725 730 735 

CAG CAA CAG GAT GTT CTG GCG GCG CAA AGT CTG CAA AAA GCC CAG GCG 2256 
Gin Gin Gin Asp Val Leu Ala Ala Gin Ser Leu Gin Lys Ala Gin Ala 
740 745 750 

CAG TTT GAC ACC GCG CTA CAG GCC AGC GTC TTT GAC GAT CAG CAG GCG 2 304 

Gin Phe Asp Thr Ala Leu Gin Ala Ser Val Phe Asp Asp Gin Gin Ala 
755 760 765 

TTC CTT GCG GCG CTA ATG GAT GAA CAA ACA CTA ACG CAG CTG GAA CAG 23 52 

Phe Leu Ala Ala Leu Met Asp Glu Gin Thr Leu Thr" Gin Leu Glu Gin 
770 775 780 

CTC AAG CAG AAT CTG GAA AAC CAG CGC CGT CAG GCG CAA ACT CTG GTC 24 00 

Leu Lys Gin Asn Leu Glu Asn Gin Arg Arg Gin Ala Gin Thr Leu Val 
785 790 795 8 00 
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ACT CAG ACA GCA GAA ACG CTG GCA CAG CAT CAA CAA CAC CGA CCT GAC 244 8 

Thr Gin Thr Ala Glu Thr Leu Ala Gin His Gin Gin His Arg Pro Asp 
805 810 815 

GAC GGG TTG GCT CTC ACT GTG ACG GTG GAG CAG ATT CAG CAA GAG TTA 24 96 

Asp Gly Leu Ala Leu Thr Val Thr Val Glu Gin He Gin Gin Glu Leu 
820 825 830 

GCG CAA ACT CAC CAA AAG TTG CGT GAA AAC ACC ACG AGT CAA GGC GAG 2544 
Ala Gin Thr His Gin Lys Leu Arg Glu Asn Thr Thr Ser Gin Gly Glu 
835 840 845 

ATT CGC CAG CAG CTG AAG CAG GAT GCA GAT AAC CGT CAG CAA CAA CAA 25 92 

He Arg Gin Gin Leu Lys Gin Asp Ala Asp Asn Arg Gin Gin Gin Gin 
850 855 860 

ACC TTA ATG CAG CAA ATT GCT CAA ATG ACG CAG CAG GTT GAG GAC TGG 264 0 

Thr Leu Met Gin Gin He Ala Gin Met Thr Gin Gin Val Glu Asp Trp 
865 870 875 880 

GGA TAT CTG AAT TCG CTA ATA GGT TCC AAA GAG GGC GAT AAA TTC CGC 26 8 8 

Gly Tyr Leu Asn Ser Leu He Gly Ser Lys Glu Gly Asp Lys Phe Arg 
885 890 895 

AAG TTT GCC CAG GGG CTG ACG CTG GAT AAT TTA GTC CAT CTC GCT AAT 2736 
Lys Phe Ala Gin Gly Leu Thr Leu Asp Asn Leu Val His Leu Ala Asn 
900 905 910 

CAG CAA CTT ACC CGG CTG CAC GGG CGC TAT CTG TTA CAG CGC AAA GCC 27 84 

Gin Gin Leu Thr Arg Leu His Gly Arg Tyr Leu Leu Gin Arg Lys Ala 
915 920 925 

AGC GAG GCG CTG GAA GTC GAG GTT GTT GAT ACC TGG CAG GCA GAT GCG -2832 
Ser Glu Ala Leu Glu Val Glu Val Val Asp Thr Trp Gin Ala Asp Ala 
930 935 940 

GTA CGC GAT. ACC CGT ACC CTT TCC GGC GGC GAA AGT TTC CTC GTT AGT 2 880, 

Val Arg Asp Thr Arg Thr Leu Ser Gly Gly Glu Ser Phe Leu Val Ser 
945 950 955 960 

CTG GCG CTG GCG CTG GCG CTT TCG GAT CTG GTC AGC CAT AAA ACA CGT 2 928 

Leu Ala Leu Ala Leu Ala Leu Ser Asp Leu Val Ser His Lys Thr Arg 
965 970 975 

ATT GAC TCG CTG TTC CTT GAT GAA GGT TTT GGC ACG CTG GAT AGC GAA 2976 
He Asp Ser Leu Phe Leu Asp Glu Gly Phe Gly Thr Leu Asp Ser Glu 
980 985 990 

ACG CTG GAT ACC GCC CTT GAT GCG CTG GAT GCC CTG AAC GCC AGT GGC 3 024 

Thr Leu Asp Thr Ala Leu Asp Ala Leu Asp Ala Leu Asn Ala Ser Gly 
995 1000 1005 

AAA ACC ATC GGT GTG ATT AGC CAC GTA GAA GCG ATG AAA GAG CGT ATT 3 072 

Lys Thr He Gly Val He Ser His Val Glu Ala Met Lys Glu Arg He 
1010 1015 1020 

CCG GTG CAG ATC AAA GTG AAA AAG ATC AAC GGC CTG GGC TAC AGC AAA 3120 
Pro Val Gin He Lys Val Lys Lys He Asn Gly Leu Gly Tyr Ser Lys 
1025 1030 ' 1035 " 1040 

CTG GAA AGT ACG TTT GCA GTG AAA TAA 3 14 7 

Leu Glu Ser Thr Phe Ala Val Lys 
1045 
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(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:85: 

Met Lys He Leu Ser Leu Arg Leu Lys Asn Leu Asn Ser Leu Lys Gly 
1 5 10 15 

Glu Trp Lys He Asp Phe Thr Arg Glu Pro Phe Ala Ser Asn Gly Leu 
20 25 30 

Phe Ala He Thr Gly Pro Thr Gly Ala Gly Lys Thr Thr Leu Leu Asp 
35 40 45 

Ala He Cys Leu Ala Leu Tyr His Glu Thr Pro Arg Leu Ser Asn Val 
50 .55 60 

Ser Gin Ser Gin Asn Asp Leu Met Thr Arg Asp Thr Ala Glu Cys Leu 
65 70 75 80 

Ala Glu Val Glu Phe Glu Val Lys Gly Glu Ala Tyr Arg Ala Phe Trp 
85 90 95 

Ser Gin Asn Arg Ala Arg Asn Gin Pro Asp Gly Asn Leu Gin Val Pro 
100 105 110 

Arg Val Glu Leu Ala Arg Cys Ala Asp Gly Lys He Leu Ala Asp Lys 
115 120 125 

Val Lys Asp Lys Leu Glu Leu Thr Ala Thr Leu Thr Gly Leu Asp Tyr 
130 135 140 

Gly Arg Phe Thr Arg Ser Met Leu Leu Ser Gin Gly Gin Phe Ala Ala 
145 150 155 160 

Phe Leu Asn Ala Lys Pro Lys Glu Arg Ala Glu Leu Leu Glu Glu Leu 
165 170 175 

Thr Gly Thr Glu He Tyr Gly Gin He Ser Ala Met Val Phe Glu Gin 
180 185 190 

His Lys Ser Ala Arg Thr Glu Leu Glu Lys Leu Gin Ala Gin Ala Ser 
195 200 205 

Gly Val Thr Leu Leu Thr Pro Glu Gin Val Gin Ser Leu Thr Ala Ser 
210 215 220 

Leu Gin Val Leu Thr Asp Glu Glu Lys Gin Leu He Thr Ala Gin Gin 
225 230 235 240 

Gin Glu Gin Gin Ser Leu Asn Trp Leu Thr Arg Gin Asp Glu Leu Gin 
245 250 ~ 255 

Gin Glu Ala Ser Arg Arg Gin Gin Ala Leu Gin Gin Ala Leu Ala Glu 
260 265 270 

Glu Glu Lys Ala Gin Pro Gin Leu Ala Ala Leu Ser Leu Ala Gin Pro 
275 280 285 
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Ala Arg Asn Leu Arg Pro His Trp Glu Arg lie Ala Glu His Ser Ala 
290 295 300 

Ala Leu Ala His lie Arg Gin Gin lie Glu Glu Val Asn Thr Arg Leu 
305 310 315 320 

Gin Ser Thr Met Ala Leu Arg Ala Ser He Arg His His Ala Ala Lys 
325 330 335 

Gin Ser Ala Glu Leu Gin Gin Gin Gin Gin Ser Leu Asn Thr Trp Leu 
340 345 350 

Gin Glu His Asp Arg Phe Arg Gin Trp Asn Asn Glu Pro Ala Gly Trp 
355 ~ ~ 360 * 365 

Arg Ala Gin Phe Ser Gin Gin Thr Ser Asp Arg Glu His Leu Arg Gin 
370 375 380 

Trp Gin Gin Gin Leu Thr His Ala Glu Gin Lys Leu Asn Ala Leu Ala 
385 390 395 400 

Ala He Thr Leu Thr Leu Thr Ala Asp Glu Val Ala Thr Ala Leu Ala 
405 410 415 

Gin His Ala Glu Gin Arg Pro Leu Arg Gin His Leu Val Ala Leu His 
420 425 430 

Gly Gin He Val Pro Gin Gin Lys Arg Leu Ala Gin Leu Gin Val Ala 
435 440 445 

He Gin Asn Val Thr Gin Glu Gin Thr Gin Arg Asn Ala Ala Leu Asn 
450 „ 455 460 

Glu Met Arg Gin Arg Tyr Lys Glu Lys Thr Gin Gin Leu Ala Asp Val 
465 470 475 480 

Lys Thr He Cys Glu Gin Glu Ala Arg He Lys Thr Leu Glu Ala Gin 
485 490 495 

Arg Ala Gin Leu Gin Ala Gly Gin Pro Cys Pro Leu Cys Gly Ser Thr 
500 505 510 

Ser His Pro Ala Val Glu Ala Tyr Gin Ala Leu Glu Pro Gly Val Asn 
515 520 525 

Gin Ser Arg Leu Leu Ala Leu Glu Asn Glu Val Lys Lys Leu Gly Glu 
530 535 540 

Glu Gly Ala Thr Leu Arg Gly Gin Leu Asp Ala He Thr Lys Gin Leu 
545 550 555 560 

Gin Arg Asp Glu Asn Glu Ala Gin Ser Leu Arg Gin Asp Glu Gin Ala 
565 570 " 575 

Leu Thr Gin Gin Trp Gin Ala Val Thr Ala Ser Leu Asn He Thr Leu 
580 585 590 

Gin Pro Leu Asp Asp He Gin Pro Trp Leu Asp Ala Gin Asp Glu His 
595 600 605 

Glu Arg Gin Leu Arg Leu Leu Ser Gin Arg His Glu Leu Gin Gly Gin 
610 615 620 
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Ile Ala Ala His Asn Gin Gin He He Gin Tyr Gin Gin Gin He Glu 
625 630 635 640 

Gin Arg Gin Gin Leu Leu Leu Thr Thr Leu Thr Gly Tyr Ala Leu Thr 
645 650 ' 655 

Leu Pro Gin Glu Asp Glu Glu Glu Ser Trp Leu Ala Thr Arg Gin Gin 
660 665 670 

Glu Ala Gin Ser Trp Gin Gin Arg Gin Asn Glu Leu Thr Ala Leu Gin 
675 680 685 

Asn Arg He Gin Gin Leu Thr Pro lie Leu Glu Thr Leu Pro Gin Ser 
690 695 700 

Asp Glu Leu Pro His Cys Glu Glu Thr Val Val Leu Glu Asn Trp Arg 
705 710 715 720 

Gin Val His Glu Gin Cys Leu Ala Leu His Ser Gin Gin Gin Thr Leu 
725 730 735 

Gin Gin Gin Asp Val Leu Ala Ala Gin Ser Leu Gin Lys Ala Gin Ala 
740 745 750 

Gin Phe Asp Thr Ala Leu Gin Ala Ser Val Phe Asp Asp Gin Gin Ala 
755 760 765 

Phe Leu Ala Ala Leu Met Asp Glu Gin Thr Leu Thr Gin Leu Glu Gin 
770 775 780 

Leu Lys Gin Asn Leu Glu Asn Gin Arg Arg Gin Ala Gin Thr Leu Val 
785 790 - 795 800 

Thr Gin Thr Ala Glu Thr Leu Ala Gin His Gin Gin His Arg Pro Asp 
805 810 815 

Asp Gly Leu Ala Leu Thr Val Thr Val Glu Gin He Gin Gin Glu Leu 
820 825 830 

Ala Gin Thr His Gin Lys Leu Arg Glu Asn Thr Thr Ser Gin Gly Glu 
835 840 845 

He Arg Gin Gin Leu Lys Gin Asp Ala Asp Asn Arg Gin Gin Gin Gin 
850 855 860 

Thr Leu Met Gin Gin He Ala Gin Met Thr Gin Gin Val Glu Asp Trp 
865 870 875 880 

Gly Tyr Leu Asn Ser Leu He Gly Ser Lys Glu Gly Asp Lys Phe Arg 
885 890 895 

Lys Phe Ala Gin Gly Leu Thr Leu Asp Asn Leu Val His Leu Ala Asn 
900 905 910 

Gin Gin Leu Thr Arg Leu His Gly Arg Tyr Leu Leu Gin Arg Lys Ala 
915 920 925 

Ser Glu Ala Leu Glu Val Glu Val Val Asp Thr Trp Gin Ala Asp Ala 
930 935 * 940 

Val Arg Asp Thr Arg Thr Leu Ser Gly Gly Glu Ser Phe Leu Val Ser 
945 950 955 960 

Leu Ala Leu Ala Leu Ala Leu Ser Asp Leu Val Ser His Lys Thr Arg 
965 970 975 
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Ile Asp Ser Leu Phe Leu Asp Glu Gly Phe Gly Thr Leu Asp Ser Glu 
980 985 990 

Thr Leu Asp Thr Ala Leu Asp Ala Leu Asp Ala Leu Asn Ala Ser Gly 
995 1000 1005 

Lys Thr He Gly Val He Ser His Val Glu Ala Met Lys Glu Arg He 
1010 1015 1020 

Pro Val Gin He Lys Val Lys Lys He Asn Gly Leu Gly Tyr Ser Lys 
1025 1030 1035 1040 

Leu Glu Ser Thr Phe Ala Val Lys 
1045 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (pl4b74)'» 

{ ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1236 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

ATG ATG AAA GAA AAA GTG ATT TTT CTC GTT GAC ATG CAA TCG TTT TAT 4 8 

Met Met Lys Glu Lys Val He Phe Leu Val Asp Met Gin Ser Phe Tyr 
15 10 15 

GCA TCT GTA GAG AAA GCG GAA AAT CCA CAT TTG AAA AAT AGG CCC GTC 96 
Ala Ser Val Glu Lys Ala Glu Asn Pro His Leu Lys Asn Arg Pro Val 
20 25 30 

ATT GTT TCG GGT GAC CCT GAA AAA AGG GGC GGA GTC GTA TTG GCT GCC 144 
He Val Ser Gly Asp Pro Glu Lys Arg Gly Gly Val Val Leu Ala Ala 
35 40 45 

TGC CCG CTG GCG AAA CAA AAG GGT GTG GTG AAT GCT TCA CGG CTG TGG 192 
Cys Pro Leu Ala Lys Gin Lys Gly Val Val Asn Ala Ser Arg Leu Trp 
50 55 60 

GAG GCG CAG GAA AAG TGT CCT GAG GCT GTT GTG CTC CGG CCG CGT ATG 24 0 

Glu Ala Gin Glu Lys Cys Pro Glu Ala Val Val Leu Arg Pro Arg Met 
65 70 75 ~ 80 

CAG CGG TAT ATT GAT GTA TCA CTG CAA ATT ACG GCC ATT CTC GAG GAG 288 
Gin Arg Tyr He Asp Val Ser Leu Gin He Thr Ala He Leu Glu Glu 
85 90 95 

TAT ACA GAC CTT GTG GAG CCG TAT TCC ATC GAT GAA CAG TTC ATG GAC 336 
Tyr Thr Asp Leu Val Glu Pro Tyr Ser He Asp Glu Gin Phe Met Asp 
100 105 110 

ATT ACA GGC AGC CAG AAG CTG TTT GGG ACG CCG ATG GAG ATC GCG AAA 3 84 

lie Thr Gly Ser Gin Lys Leu Phe Gly Thr Pro Met Glu He Ala Lys 
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115 120 125 

AGC ATT CAG GGC AGA ATC ATG CGG GAG ATC GGC GTT TAT GCA CGG GTC 43 2 

Ser He Gin Gly Arg He Met Arg Glu He Gly Val Tyr Ala Arg Val 
130 135 140 

GGA ATC GGC CCT AAC AAA GCG CTG GCC AAA ATT GCG TGT GAC AAT TTT 480 
Gly lie Gly Pro Asn Lys Ala Leu Ala Lys He Ala Cys Asp Asn Phe 
145 150 155 160 

GCC AAA AAG AAT AAG AAC GGT ATT TTT ACC TTA ACG AAA GAA AAT ATG 52 8 

Ala Lys Lys Asn Lys Asn Gly He Phe Thr Leu Thr Lys Glu Asn Met 
165 170 175 

AAA ACC GAA ATG TGG CCG CTC CCG GTG GGC AGC ATG TTT GGC GTC GGG 576 
Lys Thr Glu Met Trp Pro Leu Pro Val Gly Ser Met Phe Gly Val Gly 
180 185 190 

AGC CGC ATG AAG CAT CAT TTA AAT CGA ATG GGC ATC AGC ACG ATC GGC 624 
Ser Arg Met Lys His His Leu Asn Arg Met Gly He Ser Thr He Gly 
195 200 205 

GGG CTC GCG GCT TTT CCG CTC GAT CTT TTA AAA AAG AAA TGG GGC ATT 672 
Gly Leu Ala Ala Phe Pro Leu Asp Leu Leu Lys Lys Lys Trp Gly He 
210 215 220 

AAC GGC CAC GTG CTG TGG ATG ACG GCA AAC GGA ATC GAC TAT TCC CCT 72 0 

Asn Gly His Val Leu Trp Met Thr Ala Asn Gly He Asp Tyr Ser Pro 
225 230 235 240 

GTG TCA ACT TCG TCT CTG GAC GGG CAA AAG GCG ATA GGT CAT GGA ATG 768 
Val Ser Thr Ser Ser Leu Asp Gly Gin Lys Ala He Gly His Gly Met 
245 250 255 

ACT CTC CCG AGA GAC TAC GAA CAC TTT GAC AAA GAA ATC AAG GTC GTA 816 
Thr Leu Pro Arg Asp Tyr Glu His Phe Asp Lys Glu He Lys Val Val 
260 265 270 

TTG CTT GAG CTG AGT GAA GAG GTG TGC AGG CGA AGC CGA AAC GCC GGG 8 64 

Leu Leu Glu Leu Ser Glu Glu Val Cys Arg Arg Ser Arg Asn Ala Gly 
275 280 285 

GTC ATG GGG CAG ACA GTG TCA GTG AGC TGC CGG GGT GCT GAT TTT GAT 912 
Val Met Gly Gin Thr Val Ser Val Ser Cys Arg Gly Ala Asp Phe Asp 
290 295 300 

TGG CCG ACG GGC TTC AAC CGG CAA GTG AAG CTG GCA GAG CCG ACT AAT 960 
Trp Pro Thr Gly Phe Asn Arg Gin Val Lys Leu Ala Glu Pro Thr Asn 
305 310 315 320 

TCT ACG CAG GAT GTA TAT GAG GCT GTA CGA CGG CTG TTT CTT ACA TTT 100 8 

Ser Thr Gin Asp Val Tyr Glu Ala Val Arg Arg Leu Phe Leu Thr Phe 
325 330 335 

TGG GAC GGG AAA CCC GTC CGC CGC CTC GGT GTC AAT CTG TCT CAG CTC 1056 
Trp Asp Gly Lys Pro Val Arg Arg Leu Gly Val Asn Leu Ser Gin Leu 
340 345 350 

TCA TCT GAT GAC ATA TGG CAG CTC AAT TTA TTT CAG GAT TAT GCA AAG 1104 
Ser Ser Asp Asp He Trp Gin Leu Asn Leu Phe Gin Asp Tyr Ala Lys 
355 ^ 360 365 

AAA ATG AGC CTA GGC TAT GTG ATG GAT GGC ATT AAA AAT CGA TTC GGC 1152 
Lys Met Ser Leu Gly Tyr Val Met Asp Gly He Lys Asn Arg Phe Gly 
370 375 380 
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GAT ACA GCA 


ATC 


ATC 


AGG GCG GCG 


TCA 


CTG 


ACA 


GCG 


GCA 


GGC CAG GCA 


1200 


Asp Thr Ala 


lie 


He 


Arg Ala Ala 


Ser 


Leu 


Thr 


Ala 


Ala 


Gly Gin Ala 


385 






390 






395 






400 




TTT GAA CGT 


GCG 


GCT 


AAA ATA GGG 


GGG 


CAT 


TAT 


AAA 


TGA 




1239 


Phe Glu Arg 


Ala 


Ala 


Lys He Gly 


Gly 


His 


Tyr 


Lys 










405 






410 













(2) INFORMATION FOR SEQ ID NO:87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:87: 



Met Met Lys Glu Lys Val He Phe Leu Val Asp Met Gin Ser Phe Tyr 
15 10 15 

Ala Ser Val Glu Lys Ala Glu Asn Pro His Leu Lys Asn Arg Pro Val 
20 25 30 

He Val Ser Gly Asp Pro Glu Lys Arg Gly Gly Val Val Leu Ala Ala 
35 40 45 

Cys Pro Leu Ala Lys Gin Lys Gly Val Val Asn Ala Ser Arg Leu Trp 
50 55 60 

Glu Ala Gin Glu Lys Cys Pro Glu Ala Val Val Leu Arg Pro Arg Met 
65 70 75 80 

Gin Arg Tyr He Asp Val Ser Leu Gin He Thr Ala He Leu Glu Glu 
85 90 95 

Tyr Thr Asp Leu Val Glu Pro Tyr Ser He Asp Glu Gin Phe Met Asp 
100 105 110 

He Thr Gly Ser Gin Lys Leu Phe Gly Thr Pro Met Glu He Ala Lys 
115 120 ' 125 

Ser He Gin Gly Arg He Met Arg Glu He Gly Val Tyr Ala Arg Val 
130 135 140 

Gly He Gly Pro Asn Lys Ala Leu Ala Lys He Ala Cys Asp Asn Phe 
145 150 155 160 

Ala Lys Lys Asn Lys Asn Gly He Phe Thr Leu Thr Lys Glu Asn Met 
165 170 175 

Lys Thr Glu Met Trp Pro Leu Pro Val Gly Ser Met Phe Gly Val Gly 
180 185 190 

Ser Arg Met Lys His His Leu Asn Arg Met Gly He Ser Thr He Gly 
195 200 205 

Gly Leu Ala Ala Phe Pro Leu Asp Leu Leu Lys Lys Lys Trp Gly He 
210 215 220 

Asn Gly His Val Leu Trp Met Thr Ala Asn Gly He Asp Tyr Ser Pro 
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225 230 235 240 

Val Ser Thr Ser Ser Leu Asp Gly Gin Lys Ala He Gly His Gly Met 
245 250 255 

Thr Leu Pro Arg Asp Tyr Glu His Phe Asp Lys Glu He Lys Val Val 
260 265 270 

Leu Leu Glu Leu Ser Glu Glu Val Cys Arg Arg Ser Arg Asn Ala Gly 
275 280 285 

Val Met Gly Gin Thr Val Ser Val Ser Cys Arg Gly Ala Asp Phe Asp 
290 295 300 

Trp Pro Thr Gly Phe Asn Arg Gin Val Lys Leu Ala Glu Pro Thr Asn 
305 310 315 320 

Ser Thr Gin Asp Val Tyr Glu Ala Val Arg Arg Leu Phe Leu Thr Phe 
325 330 335 

Trp Asp Gly Lys Pro Val Arg Arg Leu Gly Val Asn Leu Ser Gin Leu 
340 345 350 

Ser Ser Asp Asp He Trp Gin Leu Asn Leu Phe Gin Asp Tyr Ala Lys 
355 360 365 

Lys Met Ser Leu Gly Tyr Val Met Asp Gly He Lys Asn Arg Phe Gly 
370 375 380 

Asp Thr Ala He He Arg Ala Ala Ser Leu Thr Ala Ala Gly Gin Ala 
385 390 395 400 

Phe Glu Arg Ala Ala Lys He Gly Gly His Tyr Lys 
405 410 



(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " DNA (genomic) (pl4cl3)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

TAAGGGGACT TATATAACTG TATGTTTGTG TAGTGTTTAT GTCAGTAAGC TAAATTTACA 60 

TTCATGTTAT GTYTCATTAA ACCAATTACT CACGTTTTGG TGCATATTAA ATCTTTTATA 12 0 

TCGATCATAC ATCTATCATC ATTTTTATTT CTAAAATACA AACTGAATAC TTTGCTAGAA 18 0 

TTTGTTACAG CAATCATTGC TCATGCATTT TATAAATTAC AATTAGACAA ATATGACATT 24 0 

TGATATCACA CACTTCAAAC ACACACATAT ATAATCAGAC ATAAATTGTT ATGCTAAGGG 300 

TTTATTCACC AAAAATATAA TACATATGGG CTTGTTTTGA GTCCATATTG AATGAATTAA 36 0 
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AAAGTATACT CCACTCAATC ATTTACAAAT AGGTGGTGCC ACTCCNAATT TATTTTATGG 
CCAACCCCAA ATATGAGAAC A 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 909 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (pl5b9) " 



(ix) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..906 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

ATG AAC CAA AAA GGC AGA GGG CTT GAG ATC CTC ATC AAT GAA AAA CAG 4 8 

Met Asn Gin Lys Gly Arg Gly Leu Glu lie Leu lie Asn Glu Lys Gin 
15 10 15 

GAC GGC CAA TGG CTG TTT TCC GTA CTC AAA ACA GCG CTC AAA GCT TCT 96 
Asp Gly Gin Trp Leu Phe Ser Val Leu Lys Thr Ala Leu Lys Ala Ser 
20 25 30 

AAA CCA GTG ATA CAA GAC TGG ATG TCC CAT CAA CAG ATA AAG GTC AAT 144 
Lys Pro Val lie Gin Asp Trp Met Ser His Gin Gin lie Lys Val Asn 
35 40 45 

CAC GAA TCC GTC TTA AAC AAT ATG ATT GTA AAA AAG GGA GAC CGC GTG 192 
His Glu Ser Val Leu Asn Asn Met lie Val Lys Lys Gly Asp Arg Val 
50 55 60 

TTC ATT GAT CTT CAG GAA AGT GAA GCA TCT TCG GTC ATT CCG GAG TAT 24 0 

Phe lie Asp Leu Gin Glu Ser Glu Ala Ser Ser Val lie Pro Glu Tyr 
65 70 75 80 

GGC GAG CTT GAT ATT TTA TTT GAG GAC AAT CAT ATG CTC ATC ATC AAT 28 8 

Gly Glu Leu Asp He Leu Phe Glu Asp Asn His Met Leu He He Asn 
85 90 95 

AAA CCC GCT GGC ATC GCG ACG CAT CCG AAT GAG GAT GGG CAA ACC GGC 336 
Lys Pro Ala Gly He Ala Thr His Pro Asn Glu Asp Gly Gin Thr Gly 
100 105 110 

ACA CTG GCT AAT TTG ATC GCG TAT CAT TAT CAG ATA AAT GGC GAA ACA 384 
Thr Leu Ala Asn Leu He Ala Tyr His Tyr Gin He Asn Gly Glu Thr 
115 120 125 

TGT AAG GTG CGC CAC GTC CAT CGT CTT GAT CAG GAT ACA TCT GGC GCT 4 32 

Cys Lys Val Arg His Val His Arg Leu Asp Gin Asp Thr Ser Gly Ala 
130 135 140 

ATC GTT TTT GCC AAG CAT CGT TTG GCA CAC GCC ATC TTA GAC CAG CAG 4 80 

He Val Phe Ala Lys His Arg Leu Ala His Ala He Leu Asp Gin Gin 
145 150 155 160 
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TTA GAG AAA AAG ACG CTG AAG CGT ACG TAT ACC GCT ATC GCT GAA GGT 528 
Leu Glu Lys Lys Thr Leu Lys Arg Thr Tyr Thr Ala lie Ala Glu Gly 
165 170 175 

AAG CTA CGG ACG AAA AAA GGG ACA ATT AAT CCA CCG ATC GGC AGA GAC 57 6 

Lys Leu Arg Thr Lys Lys Gly Thr lie Asn Pro Pro He Gly Arg Asp 
180 185 190 

CGC TCA CAC CCG ACA AGA CGC CGG GTT TCA CCA GGC GGG CAA ACA GCC 624 
Arg Ser His Pro Thr Arg Arg Arg Val Ser Pro Gly Gly Gin Thr Ala 
195 200 205 

GTC ACT CAT TTC AAG GTA ATG GCC AGC AAT GCG AAA GAA CGG CTG TCG 672 
Val Thr His Phe Lys Val Met Ala Ser Asn Ala Lys Glu Arg Leu Ser 
210 215 220 

CTC GTT GAA TTA GAG CTG GAA ACA GGC AGA ACA CAC CAA ATC CGT GTT 72 0 

Leu Val Glu Leu Glu Leu Glu Thr Gly Arg Thr His Gin He Arg Val 
225 230 235 240 

CAT CTG GCG AGC CTC GGC CAT CCG TTG ACA GGA GAC TCG CTT TAC GGA 7 68 

His Leu Ala Ser Leu Gly His Pro Leu Thr Gly Asp Ser Leu Tyr Gly 
245 ' 250 255 

GGC GGG AGC AAG CTG CTA AAC AGG CAG GCA CTG CAC GCC AAT AAA GTA 
Gly Gly Ser Lys Leu Leu Asn Arg Gin Ala Leu His Ala Asn Lys Val 
260 265 270 

CAA GCG GTT CAC CCG ATA ACA GAC GAG CTC ATA GTT GCT GAA GCC CCT 
Gin Ala Val His Pro He Thr Asp Glu Leu He Val Ala Glu Ala Pro 
275 280 285 

TTC CCT GCT GAT ATG AAA AAC CTT TGC CGC ACA TAT TTT TCA 
Phe Pro Ala Asp Met Lys Asn Leu Cys Arg Thr Tyr Phe Ser 
290 295 * 300 

TGA 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 02 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 90: 

Met Asn Gin Lys Gly Arg Gly Leu Glu He Leu He Asn Glu Lys Gin 
15 10 15 

Asp Gly Gin Trp Leu Phe Ser Val Leu Lys Thr Ala Leu Lys Ala Ser 

20 25 30 

Lys Pro Val He Gin Asp Trp Met Ser His Gin Gin He Lys Val Asn 
35 40 45 

His Glu Ser Val Leu Asn Asn Met lie Val Lys Lys Gly Asp Arg Val 
50 55 60 

Phe lie Asp Leu Gin Glu Ser Glu Ala Ser Ser Val He Pro Glu Tyr 
65 70 75 80 

Gly Glu Leu Asp lie Leu Phe Glu Asp Asn His Met Leu He lie Asn 
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85 90 95 

Lys Pro Ala Gly He Ala Thr His Pro Asn Glu Asp Gly Gin Thr Gly 
100 105 no 

Thr Leu Ala Asn Leu He Ala Tyr His Tyr Gin He Asn Gly Glu Thr 
115 120 125 

Cys Lys Val Arg His Val His Arg Leu Asp Gin Asp Thr Ser Gly Ala 
130 135 140 

He Val Phe Ala Lys His Arg Leu Ala His Ala He Leu Asp Gin Gin 
145 150 155 160 

Leu Glu Lys Lys Thr Leu Lys Arg Thr Tyr Thr Ala He Ala Glu Gly 
165 170 175 

Lys Leu Arg Thr Lys Lys Gly Thr He Asn Pro Pro He Gly Arg Asp 
180 185 190 

Arg Ser His Pro Thr Arg Arg Arg Val Ser Pro Gly Gly Gin Thr Ala 
195 200 205 

Val Thr His Phe Lys Val Met Ala Ser Asn Ala Lys Glu Arg Leu Ser 
210 215 220 

Leu Val Glu Leu Glu Leu Glu Thr Gly Arg Thr His Gin He Arg Val 
225 230 235 240 

His Leu Ala Ser Leu Gly His Pro Leu Thr Gly Asp Ser Leu Tyr Gly 
245 250 255 

Gly Gly Ser Lys Leu Leu Asn Arg Gin Ala Leu His Ala Asn Lys Val 
260 265 270 

Gin Ala Val His Pro He Thr Asp Glu Leu lie Val Ala Glu Ala Pro 
275 280 285 

Phe Pro Ala Asp Met Lys Asn Leu Cys Arg Thr Tyr Phe Ser 
290 295 300 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " DNA (genomic) (pl5b32) 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..468 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

ATG GAT GAG TTG CAG CAG GTC TCC GAT GGC TGG CTT GCG CAC CAC AAT 

Met Asp Glu Leu Gin Gin Val Ser Asp Gly Trp Leu Ala His His Asn 
1.5 10 15 



WO 99/01473 



PCT/GB98/01974 



-181- 

ACG CGG GAA AAG CGC TTC TCG CTT GGC GCT TTC GAA CCG GAC TAT ATT 96 
Thr Arg Glu Lys Arg Phe Ser Leu Gly Ala Phe Glu Pro Asp Tyr He 
20 25 30 

CTG TCG CAA CCC GTC GCC GTG CTG CGC AAG GAT GGA AAA ATC ACC GCC 144 
Leu Ser Gin Pro Val Ala Val Leu Arg Lys Asp Gly Lys He Thr Ala 
35 40 45 

TTC GCC AAT CTG ATG GTG ACG GAG ACG AAA AAG GAA GCC ACC ATC GAC 192 
Phe Ala Asn Leu Met Val Thr Glu Thr Lys Lys Glu Ala Thr He Asp 
50 55 60 

CTC ATG CGC TTT TCG GCG GAT GCG CGC GCG GCT CGA TGG ATT TCC TCT 24 0 

Leu Met Arg Phe Ser Ala Asp Ala Arg Ala Ala Arg Trp He Ser Ser 
65 70 75 80 

TCG TCA GCA TCA TGC AGC ATC TGC GCG AGG CGG GAT ATG AAA GCT TCA 28 8 

Ser Ser Ala Ser Cys Ser He Cys Ala Arg Arg Asp Met Lys Ala Ser 
85 90 95 

ATC TCG GCA TGG CGC CCA TGT CCG GCA TGT CGA AGC GCG ATG CCG CGC 33 6 

He Ser Ala Trp Arg Pro Cys Pro Ala Cys Arg Ser Ala Met Pro Arg 
100 105 110 

CGG TCT GGG ACC GTA TCG GCA GCA CGC TGT TCG AGC ACG GCG AAC GTT 3 84 

Arg Ser Gly Thr Val Ser Ala Ala Arg Cys Ser Ser Thr Ala Asn Val 
115 120 125 

TCT ACA ACT TCA AGG GAC TTC GCG CCT TCA AGG CAA AGT TCC ACC CGA 4 32 

Ser Thr Thr Ser Arg Asp Phe Ala Pro Ser Arg Gin Ser Ser Thr Arg 
130 135 140 

AAT GGG AAC CCC GTT ACC TTG CTG TGC AGA ACG GCG TGA 471 
Asn Gly Asn Pro Val Thr Leu Leu Cys Arg Thr Ala 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Met Asp Glu Leu Gin Gin Val Ser Asp Gly Trp Leu Ala His His Asn 
1 5 • 10 15 

Thr Arg Glu Lys Arg Phe Ser Leu Gly Ala Phe Glu Pro Asp Tyr He 
20 25 30 

Leu, Ser Gin Pro Val Ala Val Leu Arg Lys Asp Gly Lys He Thr Ala 
35 40 " 45 

Phe Ala Asn Leu Met Val Thr Glu Thr Lys Lys Glu Ala Thr He Asp 
50 55 60 

Leu Met Arg Phe Ser Ala Asp Ala Arg Ala Ala Arg Trp He Ser Ser 
65 70 75 " " 8Q 

Ser Ser Ala Ser Cys Ser He Cys Ala Arg Arg Asp Met Lys Ala Ser 
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lie Ser Ala Trp Arg Pro 
100 

A^g Ser Gly Thr Val Ser 
115 

Ser Thr Thr Ser Arg Asp 
130 

Asn Gly Asn Pro Val Thr 
145 150 



90 

Cys Pro Ala Cys Arg 
105 

Ala Ala Arg Cys Ser 
120 

Phe Ala Pro Ser Arg 
135 

Leu Leu Cys Arg Thr 
155 



95 

Ser Ala Met Pro Arg 
110 

Ser Thr Ala Asn Val 
125 

Gin Ser Ser Thr Arg 
140 

Ala 



(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (genomic) (pl5c4)" 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. .189 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

ATG CCT ATT GCC CAG ATC CAC ATC CTT GAA GGC CGC AGC GAC GAG CAG 4 8 

Met Pro lie Ala Gin He His He Leu Glu Gly Arg Ser Asp Glu Gin 
1 5 10 15 

AAG GAA ACC CTG ATT CGG GAA GTC AGT GAG GCC ATC TCG CGC TCC CTG 96 
Lys Glu Thr Leu He Arg Glu Val Ser Glu Ala He Ser Arg Ser Leu 
20 25 30 

GAT GCG CCG CTG ACC AGC GTG CGA GTG ATT ATC ACG GAG ATG GCC AAG 14 4 

Asp Ala Pro Leu Thr Ser Val Arg Val He He Thr Glu Met Ala Lys 
35 40 45 

GGC CAC TTC GGC ATC GGC GGC GAA CTG GCC AGC AAG GTC AGA CGC 18 9 

Gly His Phe Gly He Gly Gly Glu Leu Ala Ser Lys Val Arg Arg 
50 55 60 

TGA 192 



(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:94: 
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Met Pro lie Ala Gin lie His lie Leu Glu Gly Arg Ser Asp Glu Gin 
1 5 10 15 

Lys Glu Thr Leu He Arg Glu Val Ser Glu Ala He Ser Arg Ser Leu 
20 25 30 

Asp Ala Pro Leu Thr Ser Val Arg Val He He Thr Glu Met Ala Lys 
35 40 45 

Gly His Phe Gly He Gly Gly Glu Leu Ala Ser Lys Val Arg Arg 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc * "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:95: 
GGAGCTCACT AGTCGGAGGC ATCAGTGACC 3 0 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
GGGATCCCAT GAGAATTCTT GAAGACG 2 7 



(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
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CTAGGTACCT ACAACCTCAA GCTTNKNKNK NKNKNKNKNK NKNKNKNKNK NKNKNKNKNK 60 
NKNKAAGCTT GGTTAGAATG GGTACCATG 89 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 98: 
CTAGAATTCT ACAACCTCAA GCTT 
(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
AAGCTTGGTT AGAATGGAAT TCATG 2 5 

(2) INFORMATION FOR SEQ ID NO:100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
GAATTCCATT CTAAC 15 
(2) INFORMATION FOR SEQ ID NO: 101: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s i ng 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 
ATTCCATTCT AACCAAGC 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 
TGAACTGCCA CTGTAGAGAG A 
(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 
GGCCATCGAT AATGAAATTA ATTAACGAGA GACAAATAGG 
(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
GGCCGGATCC CTAGTGATGG TGATGGTGAT GAAAAATTCT GTCTTTAACT TTTTT 55 



(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQIDNO:105: 
GGCCGGTACC AAATTAATTA ACGAGAGACA AATAGG 
(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO -.106: 
GGCCGGATCC CTAGTGATGG TGATGGTGAT GAAAAATTCT GTCTTTAACT TTTTT 



(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
TGGAGATCTA AGCTTTGCAT AACTTTCTCG TCC 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
TCCTGGCGAT TCTGAGAC 
(2) INFORMATION FOR SEQ ID NO: 109 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
TGGGGATCCG ATAAGTGTGA CTGGTAG 



(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi)- SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
TGGAAGCTTA CATTACTTCA AATAAATTA 
(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
TGGGGATCCT GCATATCAAA ATGTTTATGG C 31 
C2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
TGGAAGCTTA CACATATGCC AATCTCAC 2 8 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc - "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 
GTTGGATCCG CTGTTGTTAC TTTGATGC 28 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 



TGGAAGCTTA CATTACTTCA AATAAATTA 
(2) INFORMATION FOR SEQ ID NO: 115: 



2 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
GTTGGATCCG CTGTTGTTAC TTTGATGC 28 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
TGGAAGCTTA CACATATGCC AATCTCAC 28 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
ATTTGATATG TCTCAACTGC 20 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 



GCTCTAATTT TTAAGTGAGG 20 
(2) INFORMATION FOR SEQ ID NO: 119: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc - "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
TATCTGGTGG CGTAACACCT G 21 



(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 
GATGAAGCCG TTACGTTGTT C 21 



(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : other nucleic acid 
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(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 



GCCATAAGGA TGTGAATGTA TG 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc - "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

GCATTTGCTA GTTATCTTG 19 
(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 



AGATCTATCA AGGATGTGAT GGTT 24 
(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 
GTCATTATAC GATTTAGTAC AATC 



24 
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(2) INFORMATION FOR SEQ ID NO : 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

Met Lys Asp Glu Gin Leu Tyr Tyr Phe Glu Lys Ser Pro Val Phe Lys 
1 5 * 10 15 

Ala Met Met His Phe Ser Leu Pro Met Met lie Gly Thr Leu Leu Ser 
20 25 30 

Val lie Tyr Gly He Leu Asn Tyr He Phe He Gly Phe Ser Glu Glu 
35 40 45 

Ser His Met He Ser Ala He Ser Leu Thr Leu Pro Val Phe Ala He 
50 55 60 

Leu Met Gly Leu Gly Asn Leu Phe Gly Val Gly Ala Gly Thr Tyr He 
65 70 75 80 

Ser Arg Leu Leu Gly Ala Lys Asp Tyr Ser Lys Ser Lys Phe Val Ser 
85 90 95 

Ser Phe Ser He Tyr Gly Gly He Ala Leu Gly Leu He Val He Leu 
100 105 110 

Val Thr Leu Pro Phe Ser Asp Gin Ser Gin Gin Phe 
115 120 



(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 6: 
GTGTGACTGG TAGAAATCAG TCACTTTGTC TTTAATATTA TAGTTAGATA TCTAATTGTT 60 
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AGTAAGCTAA TTATTGGAAA AGACAAGGAG TATTGAACAA TGAAAGACGA ACAATTATAT 120 

TATTTTGAGA AATCGCCAGT ATTTAAAG CG ATGATGCATT TCTCATTGCC AATGATGATA 180 

GGGACTTTAT TAAGCGTTAT TTATGGCATA TTAAATATTT ACTTTATAGG ATTTTTAGAA 240 

GATAGCCACA TGATTTCTGC TATCTCTCTA ACACTGCCAG TATTTGCTAT CTTAATGGGG 300 

TTAGGTAATT TATTTGGCGT TGGTGCAGGA ACTTATATTT CACGTTTATT AGGTGCGAAA 360 

GACTATAGTA AGAGTAAATT TGTAAGTAGT TTCTCTATTT ATGGTGGTAT TGCACTAGGA 420 

CTTATCGTGA TTTTAGTTAC TTTACCATTC AGTGATCAAA TCGCAGCAAT TTTAGGGGCG 480 

AGAGGTGAAA CGTTAGCTTT AACAAGTAAT TATTTGAAAG TAATGTTTTT AAGTGCACCT 540 

TTTGTAATTT TGTTCTTCAT ATTAGAACAA TTTGCACGTG CAATTGGGGC ACCAATGGTT 600 

TCTATGATTG GTATGTTAGC TAGTGTAGGC TTAAATATTA TTTTAGATCC AATTTTAATT 660 

TTTGGTTTTG ATTTAAACGT TGTTGGTGCA GCTTTGGGTA CTGCAATCAG TAATGTTGCT 720 

GCTGCTCTGT TCTTTATCAT TTATTTTATG AAAAATAGTG ACGTTGTGTC AGTTAATATT 780 

AAACTTGCGA AACCTAATAA AGAAATGCTT TCTGAAATCT TTAAAATCGG TATTCCTG C A 840 

TTTTTAATGA GTATCTTAAT GGGATTCACA GGATTAGTTT TAAATTTATT TTT AG CAC AT 900 

TATGGAAACT TCGCGATTGC AAGTTATGGT ATCTCATTTA . GACTTGTGCA ATTTCCAGAA 960 

CTTATTATCA TGGGATTATG TGAAGGTGTT GTACCACTAA TTGCATATAA CTTTATGGCA 1020 

AATAAAGGCC GTATGAAAGA CGTTATCAAA GCAGTTATCA TGTCTATCGG CGTTATCTTT 1080 

GTTGTATGTA TGAGTGCTGT ATTTACAATT GGACATCATA TGGTCGGACT ATTTACTACT 114 0 

GATCAAGCCA TTGTTGAGAT GGCGACATTT ATTTTGAAAG TAACAATGGC ATCATTATTA 1200 

TTAAATGGTA TAGGTTTCTT GTTTACTGGT ATGCTTCAAG CGACTGGGCA AGGTCGTGGT 126 0 

GCTACAATTA TGGCCATTTT ACAAGGTGCA ATTATCATTC CAGTATTATT TATTATGAAT 132 0 

GCTTTGTTTG GACTAACAGG TGTCATTTGG TCATTATTAA TTGCTGAGTC ACTTTGTGCT 13 80 

TTAGCAGCAA TGTTAATCGT CTATTTATTA CGTGATCGTT TGACAGTTGA TACATCTGAA 144 0 

TTAATAGAAG GTTAAATATT TCGTCCACTT CTGGCTGAGT ATATTTCGGT CGGAAGTGTA 1500 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 51 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

Met Lys Asp Glu Gin Leu Tyr Tyr Phe Glu Lys Ser Pro Val Phe Lys 
15 10 15 

Ala Met Met His Phe Ser Leu Pro Met Met He Gly Thr Leu Leu Ser 
20 25 30 

Val He Tyr Gly He Leu Asn He Tyr Phe He Gly Phe Leu Glu Asp 
35 40 45 

Ser His Met He Ser Ala He Ser Leu Thr Leu Pro Val Phe Ala He 
50 55 60 

Leu Met Gly Leu Gly Asn Leu Phe Gly Val Gly Ala Gly Thr Tyr lie 
65 70 75 80 

Ser Arg Leu Leu Gly Ala Lys Asp Tyr Ser Lys Ser Lys Phe Val Ser 
85 90 95 

Ser Phe Ser He Tyr Gly Gly lie Ala Leu Gly Leu lie Val lie Leu 
100 105 HO 

Val Thr Leu Pro Phe Ser Asp Gin lie Ala Ala lie Leu Gly Ala Arg 
115 120 125 

Gly Glu Thr Leu Ala Leu Thr Ser Asn Tyr Leu Lys Val Met Phe Leu 
130 135 140 

Ser Ala Pro Phe Val lie Leu Phe Phe lie Leu Glu Gin Phe Ala Arg 
145 150 155 160 

Ala lie Gly Ala Pro Met Val Ser Met lie Gly Met Leu Ala Ser Val 
165 170 175 

Gly Leu Asn lie lie Leu Asp Pro lie Leu lie Phe Gly Phe Asp Leu 
180 185 190 

Asn Val Val Gly Ala Ala Leu Gly Thr Ala lie Ser Asn Val Ala Ala 
195 200 205 

Ala Leu Phe Phe lie lie Tyr Phe Met Lys Asn Ser Asp Val Val Ser 
210 215 220 
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Val Asn lie Lys Leu Ala Lys Pro Asn Lys Glu Met Leu Ser Glu lie 
225 230 235 240 

Phe Lys lie Gly lie Pro Ala Phe Leu Met Ser lie Leu Met Gly Phe 
245 250 255 

Thr Gly Leu Val Leu Asn Leu Phe Leu Ala His Tyr Gly Asn Phe Ala 
260 265 270 

lie Ala Ser Tyr Gly lie Ser Phe Arg Leu Val Gin Phe Pro Glu Leu 
275 280 285 

He He Met Gly Leu Cys Glu Gly Val Val Pro Leu He Ala Tyr Asn 
290 295 300 

Phe Met Ala Asn Lys Gly Arg Met Lys Asp Val He Lys Ala Val He 
305 310 315 320 

Met Ser He Gly Val He Phe Val Val Cys Met Ser Ala Val Phe Thr 
325 330 335 

He Gly His His Met Val Gly Leu Phe Thr Thr Asp Gin Ala He Val 
340 345 350 

Glu Met Ala Thr Phe He Leu Lys Val Thr Met Ala Ser Leu Leu Leu 
355 360 365 

Asn Gly He Gly Phe Leu Phe Thr Gly Met Leu Gin Ala Thr Gly Gin 
370 375 380 

Gly Arg Gly Ala Thr He Met Ala He Leu Gin Gly Ala He He lie 
385 390 395 400 

Pro Val Leu Phe He Met Asn Ala Leu Phe Gly Leu Thr Gly Val lie 
405 410 415 

Trp Ser Leu Leu He Ala Glu Ser Leu Cys Ala Leu Ala Ala Met Leu 
420 425 430 

lie Val Tyr Leu Leu Arg Asp Arg Leu Thr Val Asp Thr Ser Glu Leu 
435 440 445 



lie Glu Gly 
450 
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WHAT IS CLAIMED IS: 

1. A method of identifying an anti-bacterial agent comprising the 
steps of assaying potential agents for the ability to interfere with expression of 

• virulence gene products represented by the DNA sequences set forth in any one 
5 of SEQ ID NOS: 1 through 94, and identifying an agent that interferes with 
expression of said virulence gene products. 

2. A method of identifying an anti-bacterial agent comprising the 
steps of assaying potential agents for the ability to interfere with the function of 

10 a bacterial protein encoded in whole or in part by a DNA sequence set forth in 
any one of SEQ ID NOS: 1 through 94 or the complementary strand thereof, 
and identifying an agent that interferes with the function of said bacterial 
protein. 

15 3.- The method of claim 2 wherein the protein is involved in 

cell surface metabolism and is encoded by the DNA sequence of any one of SEQ 
ID NOS: 1, 6, 60 and 65 (corresponding to signature tag identification numbers 
P2C73, P11C29, P13C83, P9B65 and P10B89). 

20 4. The method of claim 2 wherein the protein is an enzyme 

involved in a cellular biosynthetic pathway and is encoded by the DNA sequence 
of any one of SEQ ID NOS: 4, 11, 13, 22, 25, 27, 29, 31, 33, 35 and 82 
(corresponding to signature tag identification numbers P9B74, P5C4, P9B66, 
P14C15, P13B26, P7C18, P15C31, P10B18, P6B18, P10B66, P10C34 and 

25 P12C3). 

5. The method of claim 2 wherein the protein is a component 
of the TCA cycle and is encoded by the DNA sequence of any one of SEQ ID 
NOS: 37, 39, 41, 43 and 44 (corresponding to signature tag identification 
30 numbers P4C27, P4C52, P10B2, P10C20 and P12C32). 
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6. The method of claim 2 wherein the protein is an 
oligopeptide transport proteins and is encoded by the DNA sequence of any one 
of SEQ ID NOS: 15, 17, 19, 54, 75, 77 and 79 (corresponding to signature tag 
identification numbers P7C26, P10C15, P5C3, P11C66 and P5C34). 

5 

7. The method of claim 2 wherein the protein is involved in 
cellular regulatory and repair processes and is encoded by the DNA sequence of 
any one of SEQ ID NOS: 8, 21, 46 and 84 (corresponding to signature tag 
identification numbers P4C15, P6C63, P13B74, P13C72, P10B30 and P14B25). 

10 

8. The method of any one of claims 1 through 7 wherein the 
assay used is an enzyme assay. 

9. The method of any one of claims 1 through 7 wherein the 
15 assay used is a binding assay. 

10. The method of any one of claims 1 through 7 wherein the 
assay used is a two-hybrid screening assay. 

20 11. The method of any one of claims 1 through 7 wherein the 

assay is being used to screen for a ligand by binding to target protein. 

12. The method of any one of claims 1 through 7 wherein the 
- assay used is an affinity ultrafiltration assay. 

25 

13. A Staphylococcus aureus organism containing a functional 
mutation in a gene represented by any one of SEQ ID NOS: 1 through 94, said 
functional mutation resulting in a reduction in virulence of the organism. 

30 14. A vaccine composition comprising the S. aureus organism 

of claim 13. 
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